Developmental mutations in zebrafish

ABSTRACT

The invention features novel zebrafish nucleic acid and amino acid molecules, and zebrafish containing mutations in important developmental genes. In addition, the invention features the use of these nucleic acid and amino acid molecules in methods of diagnosing, preventing, and treating a variety of mammalian diseases and developmental disorders. Furthermore, zebrafish mutant for a nucleic acid or amino acid molecule of the invention may be used in screens for compounds that modulate the development of an organism as a whole or of specific tissues or organs within an organism. In particular, the present invention features novel nucleic acid sequences involved, e.g., in kidney development and kidney disorders. Mutations in these sequences, for example, result in the formation of cysts in the kidney.

CROSS REFERENCE TO RELATED APPLICATIONS

[0001] This application claims the benefit of U.S. Provisional Application Serial No. 60/368,760, filed Mar. 29, 2002, the disclosure of which is hereby incorporated by reference.

FIELD OF THE INVENTION

[0002] The field of the invention is developmental diseases and disorders.

BACKGROUND OF THE INVENTION

[0003] Genetic screens have been the most successful approach for identifying genes required for developmental processes. Applied on a sufficiently large scale, a genetic screen can identify all of the genes which, when mutated one at a time, impact the phenotype of interest. Notably, genetic screens are relatively unbiased since no assumptions about the genes involved in the biological processes of interest need to be made and, thus, such screens can reveal novel genetic pathways underlying important phenotypes.

[0004] It has long been recognized in the art that genetic screens in vertebrate animals would be highly informative and would help identify many new genes required for the development of vertebrate organs and structures and such genetic screens have been carried out in mice and zebrafish (Rinchik, Trends Genet. 7:15-21, 1991; Driever et al., Development 123:37-46, 1996; Haffter et al., Development 123:1-36, 1996). Genetic screens using chemical mutagens in zebrafish suggested that there are only about 2,400 genes that can be mutated to yield a visible phenotype affecting the development of a fertilized egg to a free-swimming five-day old larva (Haffter et al., Development 123:1-36, 1996). In accordance to what was observed in invertebrates, these results suggest that a small number of genes may be essential for the development of a viable vertebrate (Driever et al., Development 123:37-46, 1996; Haffter et al., Development 123:1-36, 1996; Dove, Genetics 116:5-8, 1987). In other respects, however, genetic screens in vertebrates have not been nearly as informative as those in invertebrates. The difficulty of performing large enough screens to identify all the genes required for a specific biological process, and the difficulty of rapidly cloning mutated genes from vertebrate genomes contribute to this shortcoming. Nevertheless, the small-scale screens that have been performed in zebrafish and mice hint at the vast potential of this approach.

[0005] Simply identifying mutant phenotypes in a genetic screen can be informative by revealing both the kinds of phenotypes that can occur and the number of genes involved in the process of interest. In zebrafish, simple visual screens of embryos in the first 5 days after fertilization can reveal mutations in genes essential for the normal development of most of the major organ systems, including the nervous system, heart, blood, gut, liver, kidney, jaws, eyes, and ears. However, to understand how genes specify a biological process, it is essential to identify the mutated genes responsible for the phenotypes.

[0006] Insertional mutagenesis, when compared to chemical mutagenesis, greatly speeds cloning the mutated gene. The integration of exogenous DNA sequences into a genome can be mutagenic, and simplifies cloning of the mutated genes since the inserted DNA serves as a tag to aid in isolating the flanking DNA sequence. Previously, insertional mutagens, including DNA viruses as well as retroviruses, have been used successfully in Drosophila and mice. For example, mouse retroviral vectors pseudotyped with a VSV-G envelope were found to be able to infect the fish germ line following injection of virus into blastula-stage embryos at the 1000 to 2000-cell stage. In addition, retroviruses were attractive candidates for insertional mutagens, because they had been shown to integrate into many different sites in mammalian and avian chromosomes and to be effective mutagens in mice. Importantly, they integrate without rearrangement of their own sequences or significant alterations to host DNA sequences at the site of insertion, essential features for easily cloning genes disrupted by insertions. Applying this approach to zebrafish to identify and clone genes important in zebrafish development is desirable and is likely to provide significant insights into many aspects of vertebrate development and, thereby, aid in our understanding, diagnosis, and treatment of a variety of diseases, including ones that affect humans.

SUMMARY OF THE INVENTION

[0007] The invention features novel zebrafish nucleic acid and amino acid molecules, and zebrafish containing mutations in important developmental genes. In addition, the invention features the use of these nucleic acid and amino acid molecules in methods of diagnosing, preventing, and treating a variety of mammalian diseases and developmental disorders. Furthermore, zebrafish mutant for a nucleic acid or amino acid molecule of the invention may be used in screens for compounds that modulate the development of an organism as a whole or of specific tissues or organs within an organism. In particular, the present invention features novel nucleic acid sequences involved, e.g., in kidney development and kidney disorders. Mutations in these sequences, for example, result in the formation of cysts in the kidney.

[0008] Accordingly, the first aspect of the invention features an isolated nucleic acid molecule, e.g., a mouse, human, or zebrafish nucleic acid moleucle, including a nucleic acid sequence having at least 75% nucleic acid sequence identity to the 459 nucleic acid sequence of SEQ ID NO:59 over at least 600 contiguous nucleic acids, where this nucleic acid molecule functions in kidney development. In desirable embodiments of this aspect, the nucleic acid sequence includes the sequence of SEQ ID NO:59. In addition, the nucleic acid molecule may include a viral insertion at a nucleotide corresponding to nucleotide 210 of SEQ ID NO:59. Desirably, the invention features a zebrafish containing a viral insertion at a nucleotide corresponding to nucleotide 210 of SEQ ID NO:59 Further desirable embodiments are vector including the isolated nucleic acid molecule of this aspect of the invention and a cell including this vector.

[0009] In another aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 75% sequence identity to the 459 amino acid sequence of SEQ ID NO:60 over at least 250 contiguous amino acids, where this polypeptide functions in kidney development. Desirably, the polypeptide includes the sequence of SEQ ID NO:60.

[0010] The invention also features a method of treating or preventing a kidney disorder in an organism. This method includes the step of contacting the organism with a therapeutically effective amount of the nucleic acid of the first aspect of the invention, or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a 459 nucleic acid sequence in the organism, and where the alteration in the level of expression treats or prevents a kidney disorder. The nucleic acidmolecule used in this method can be a cDNA or an mRNA molecule and the contacting can result in an increase in expression of the polypeptide encoded by a nucleic acid sequence including the sequence of SEQ ID NO:59. Alternatively, the nucleic acid molecule used in this method can be a double-stranded RNA molecule and the contacting can lead to a decrease in expression, or inhibits biological activity, of a nucleic acid sequence including SEQ ID NO:59. Further, the nucleic acid molecule used in this method may be an anti-sense RNA molecule, and the contacting can lead to a decrease in expression, or inhibition of biological activity, of a nucleic acid sequence including the sequence of SEQ ID NO:59.

[0011] The invention also features method for diagnosing a kidney disorder or the propensity to develop a kidney disorder in an organism. This method includes detecting an alteration in the level of 459 polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a 459 polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the 459 polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a kidney disorder. Desirably, the 459 polypeptide includes the amino acid sequence of SEQ ID NO:60.

[0012] Further, the invention features another method for diagnosing a kidney disorder or the propensity to develop a kidney disorder in an organism. This method involves detecting an alteration in the sequence, or a fragment of the sequence, of a 459 nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a 459 nucleic acid molecule derived from a second, control organism, where an alteration of the 459 sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a kidney disorder. In desirable embodiments of this aspect of the invention, the alteration is a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:59 or, alternatively, the alteration is an increase in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:59.

[0013] Another aspect, the invention features an isolated nucleic acid molecule including a sequence having at least 85% nucleic acid sequence identity to the 904 nucleic acid sequence of SEQ ID NO:1 over at least 500 contiguous nucleic acids, where this nucleic acid molecule functions in neural tissue proliferation, Central Nervous System (CNS) development, or vascular development. In addition, this isolated nucleic acid sequence may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 1315 of SEQ ID NO:1.

[0014] In a further aspect, the invention features an isolated nucleic acid molecule including a sequence having at least 85% nucleic acid sequence identity to the U2AF nucleic acid sequence of SEQ ID NO:7 over at least 600 contiguous nucleic acids, where this nucleic acid molecule functions in brain or tectum development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 46 and 47 of SEQ ID NO:7.

[0015] In a further aspect, the invention features an isolated nucleic acid molecule including a sequence having at least 40% nucleic acid sequence identity to the 954 nucleic acid sequence of SEQ ID NO:9 over at least 500 contiguous nucleic acids, where this nucleic acid molecule functions in cartilage development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 432 or 506 of SEQ ID NO:9.

[0016] In a further aspect, the invention features an isolated nucleic acid molecule including a sequence having at least 90% nucleic acid sequence identity to the V-ATPase Alpha Subunit nucleic acid sequence of SEQ ID NO:15 over at least 250 contiguous nucleic acids, where this nucleic acid molecule functions in eye or body pigmentation. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 169 of SEQ ID NO:15.

[0017] In a further aspect, the invention features an isolated nucleic acid molecule including a sequence having at least 85% nucleic acid sequence identity to the V-ATPase SFD Subunit nucleic acid sequence of SEQ ID NO:17 over at least 600 contiguous nucleic acids, where this nucleic acid molecule functions in eye or body pigmentation. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 31 and 32 of SEQ ID NO:17.

[0018] In a further aspect, the invention features an isolated nucleic acid molecule including a sequence having at least 90% nucleic acid sequence identity to the V-ATPase-16 kDa Proteolytic Subunit nucleic acid sequence of SEQ ID NO:19 over at least 400 contiguous nucleic acids, where this nucleic acid molecule functions in body or eye pigmentation or touch sensitivity. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 242 and 243 of SEQ ID NO:19.

[0019] In a further aspect, the invention features an isolated nucleic acid molecule including a sequence having at least 75% nucleic acid sequence identity to the 1463 nucleic acid sequence of SEQ ID NO:157 over at least 200 contiguous nucleic acids, where this nucleic acid molecule functions in body pigmentation, brain development, or vascular development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 389 and 390 of SEQ ID NO:157.

[0020] In an further aspect, the invention features an isolated nucleic acid molecule including a sequence having at least 90% nucleic acid sequence identity to the VPSP18 nucleic acid sequence of SEQ ID NO:21 over at least 75 contiguous nucleic acids, where this nucleic acid molecule functions in pigmentation, photoreceptor, retinal, or brain development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 2336 of SEQ ID NO:21.

[0021] In a further aspect, the invention features an isolated nucleic acid molecule including a sequence having at least 85% nucleic acid sequence identity to the 60S Ribosomal Protein L35 nucleic acid sequence of SEQ ID NO:25 over at least 350 contiguous nucleic acids, where this nucleic acid molecule functions in head, eye, or brain development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 30 and 31 of SEQ ID NO:25.

[0022] In a further aspect, the invention features an isolated nucleic acid molecule including a sequence having at least 85% nucleic acid sequence identity to the 215 nucleic acid sequence of SEQ ID NO:31 over at least 200 contiguous nucleic acids, where this nucleic acid molecule functions in eye or jaw development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 294 and 295 of SEQ ID NO:31.

[0023] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 95% nucleic acid sequence identity to the 307 nucleic acid sequence of SEQ ID NO:33 over at least 50 contiguous nucleic acids, where this nucleic acid molecule functions in jaw or cartilage development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 176 of SEQ ID NO:33.

[0024] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 40% nucleic acid sequence identity to the 572 nucleic acid sequence of SEQ ID NO:35, where this nucleic acid functions in jaw or cartilage development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 277 of SEQ ID NO:35.

[0025] In a further aspect, this invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 42% nucleic acid sequence identity to the 1116A nucleic acid sequence of SEQ ID NO:37 over at least 1000 contiguous nucleic acids, where this nucleic acid molecule functions in jaw development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 135 of SEQ ID NO:37.

[0026] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity over at least 1000 contiguous nucleic acids to the 1548 nucleic acid sequence of SEQ ID NO:39, where this nucleic acid molecule functions in eye, head, heart, limb, jaw, or neurocranium development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 85 of SEQ ID NO:39.

[0027] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 85% nucleic acid sequence identity to Casein Kinase 1α nucleic acid sequence of SEQ ID NO:41 over at least 1000 contiguous nucleic acids, where this nucleic acid molecule functions in jaw, limb, or cartilage development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 730 and 731 of SEQ ID NO:41.

[0028] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the 429 nucleic acid sequence of SEQ ID NO:47 over at least 200 contiguous nucleic acids, where this nucleic acid molecule functions in vascular, liver, gall bladder, pancreas, or gut development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 182 and 183 of SEQ ID NO:47.

[0029] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 85% nucleic acid sequence identity to the 428 nucleic acid sequence of SEQ ID NO:49 over at least 200 contiguous nucleic acids, where this nucleic acid molecule functions in muscle or brain development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 187 of SEQ ID NO:49.

[0030] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 85% nucleic acid sequence identity to the Spinster nucleic acid sequence of SEQ ID NO:51 over at least 100 contiguous nucleic acids, where this nucleic acid molecule functions in infertility disorders. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to a nucleotide approximately 2 to 5 kb upstream of nucleotide 209 of SEQ ID NO:51.

[0031] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 85% nucleic acid sequence identity to the Kinesin-Related Motor Protein EGS nucleic acid sequence of SEQ ID NO:57 over at least 600 nucleic acids, where this nucleic acid molecule functions in cell death regulation or body development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 50 and 51 of SEQ ID NO:57.

[0032] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 75% nucleic acid sequence identity to the 459 nucleic acid sequence of SEQ ID NO:59 over at least 600 contiguous nucleic acids, where this nucleic acid molecule functions in cell death regulation in the CNS or in body development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 210 of SEQ ID NO:59.

[0033] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the Vesicular Integral-Membrane Protein VIP 36 nucleic acid sequence of SEQ ID NO:65 over at least 300 contiguous nucleic acids, where this nucleic acid molecule functions in touch sensitivity. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as one between nucleotides corresponding to nucleotides 219 and 220 of SEQ ID NO:65.

[0034] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 85% nucleic acid sequence identity to the 299 nucleic acid sequence of SEQ ID NO:67 over at least 100 contiguous nucleic acids, where this nucleic acid molecule functions in brain or eye apoptosis, or in jaw, cartilage, or limb development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 47 and 48 of SEQ ID NO:67.

[0035] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 40% nucleic acid sequence identity to the 994 nucleic acid sequence of SEQ ID NO:69 over at least 500 contiguous nucleic acids, where this nucleic acid molecule functions in eye, head, jaw, cartilage, or stomach development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 66 and 67 of SEQ ID NO:69.

[0036] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 85% nucleic acid sequence identity to the 1373 nucleic acid sequence of SEQ ID NO:71 over at least 350 contiguous nucleic acids, where this nucleic acid molecule functions in brain, eye, or body development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 118 and 119 of SEQ ID NO:71.

[0037] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the Denticleless nucleic acid sequence of SEQ ID NO:73 over at least 200 contiguous nucleic acids, where this nucleic acid molecule functions in CNS, body, somite, yolk sac, or eye development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 307 and 308 of SEQ ID NO:73.

[0038] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 35% nucleic acid sequence identity to the Telomeric Repeat Factor 2 nucleic acid sequence of SEQ ID NO:79 over at least 250 contiguous nucleic acids, where this nucleic acid molecule functions in brain or eye development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 529 and 530 of SEQ ID NO:79.

[0039] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the SIL nucleic acid sequence of SEQ ID NO:81 over at least 100 contiguous nucleic acids, where this nucleic acid molecule functions in brain, head, or body development, or in motility. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 273 and 274 of SEQ ID NO:81.

[0040] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 85% nucleic acid sequence identity to the U1 Small Nuclear Ribonucleoprotein C nucleic acid sequence of SEQ ID NO:83 over at least 200 contiguous nucleic acids, where this nucleic acid molecule functions in body, eye, hindbrain, ear, pigment, or limb development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 52 and 53 of SEQ ID NO:83.

[0041] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to Ski Interacting Protein nucleic acid sequence of SEQ ID NO:85 over at least 850 contiguous nucleic acids, where this nucleic acid molecule functions in brain or body development, or motility. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to a nucleotide approximately 1.2 kb upstream of nucleotide 19 of SEQ ID NO:85.

[0042] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 85% nucleic acid sequence identity to the 297 nucleic acid sequence of SEQ ID NO:87 over at least 200 contiguous nucleic acids, where this nucleic acid molecule functions in motility, cartilage, jaw, eye, tail, or brain development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 74 of SEQ ID NO:87.

[0043] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the TCP-1 Complex Gamma Chain nucleic acid sequence of SEQ ID NO:89 over at least 1500 contiguous nucleic acids, where this nucleic acid molecule functions in yolk sac development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 75 and 76 of SEQ ID NO:89.

[0044] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 90% nucleic acid sequence identity to the Small Nuclear Ribonucleoprotein D1 nucleic acid sequence of SEQ ID NO:91 over at least 200 contiguous nucleic acids, where this nucleic acid molecule functions in CNS or eye development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 76 and 77 of SEQ ID NO:91.

[0045] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the DNA Polymerase Epsilon Subunit B nucleic acid sequence of SEQ ID NO:93 over at least 1050 contiguous nucleic acids, where this nucleic acid molecule functions in brain or eye development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 929 of SEQ ID NO:93 or one between nucleotides corresponding to nucleotides 1161 and 1162 of SEQ ID NO:93.

[0046] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the 821-02 nucleic acid sequence of SEQ ID NO:95 over at least 100 contiguous nucleic acids, where this nucleic acid molecule functions in cell death regulation in the CNS or the eye. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 231 and 232 or 369 and 370 of SEQ ID NO:95.

[0047] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the 1045 nucleic acid sequence of SEQ ID NO:97 over at least 600 contiguous nucleic acids, where this nucleic acid molecule functions in brain or head development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 216 and 344 of SEQ ID NO:97.

[0048] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 75% nucleic acid sequence identity to the 1055-1 nucleic acid sequence of SEQ ID NO:99 over at least 600 contiguous nucleic acids, where this nucleic acid molecule functions in cell cycle progression or 60S ribosomal subunit biogenesis. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 167 and 168 of SEQ ID NO:99.

[0049] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the Spliceosome Associated Protein 49 nucleic acid sequence of SEQ ID NO:101 over at least 700 contiguous nucleic acids, where this nucleic acid molecule functions in tectal or body development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 53 and 54 of SEQ ID NO:101.

[0050] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the DNA Replication Licensing Factor MCM7 nucleic acid sequence of SEQ ID NO:103 over at least 300 contiguous nucleic acids, where this nucleic acid molecule functions in eye or CNS development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 198 of SEQ ID NO:103, or one between nucleotides corresponding to nucleotides 121 and 122 of SEQ ID NO:103.

[0051] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 85% nucleic acid sequence identity to the Dead-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid sequence of SEQ ID NO:105 over at least 900 contiguous nucleic acids, where this nucleic acid molecule functions in brain development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 132 of SEQ ID NO:105.

[0052] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the 1581 nucleic acid sequence of SEQ ID NO:107 over at least 200 contiguous nucleic acids, where this nucleic acid molecule functions in head or eye development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 346 and 347 of SEQ ID NO:107.

[0053] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 90% nucleic acid sequence identity to the Imitation Switch (ISWI)/SNF2 nucleic acid sequence of SEQ ID NO:111 over at least 200 contiguous nucleic acids, where this nucleic acid molecule functions in development of the visual system, brain, jaw or cartilage. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 76 of SEQ ID NO:111.

[0054] In a further aspect, this invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the Chromosomal Assembly Protein C (XCAP-C) nucleic acid sequence of SEQ ID NO:113 over at least 600 contiguous nucleic acids, where this nucleic acid molecule functions in optic tectum, eye, or hindbrain development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 181 and 182 of SEQ ID NO:113.

[0055] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the DNA Replication Licensing Factor MCM2 nucleic acid sequence of SEQ ID NO:115 over at least 1200 contiguous nucleic acids, where this nucleic acid molecule functions in eye, optic tectum, jaw, or cartilage development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide in an corresponding to a nucleotide in an intron preceding nucleotide 399 of SEQ ID NO:115.

[0056] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the DNA Replication Licensing Factor MCM3 nucleic acid sequence of SEQ ID NO:117 over at least 600 contiguous nucleic acids, where this nucleic acid molecule functions in optic tectum, head, or eye development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 50 of SEQ ID NO:117, or one between nucleotides corresponding to nucleotides 75 and 76 of SEQ ID NO:117.

[0057] In a further aspect, the invention features an isolated nucleic acid molecule including a Valyl-tRNA Synthase nucleic acid sequence of SEQ ID NO:119, where this nucleic acid molecule functions in cell death regulation, head or eye development, or eye pigmentation. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 30 and 31 of SEQ ID NO:119.

[0058] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 85% nucleic acid sequence identity to the 405 Ribosomal Protein S5 nucleic acid sequence of SEQ ID NO:121 over at least 600 contiguous nucleic acids, where this nucleic acid molecule functions in brain development or motility. In addition, thisisolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 31 and 32 of SEQ ID NO:121.

[0059] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the TCP-1 Beta nucleic acid sequence of SEQ ID NO:123 over at least 1500 contiguous nucleic acids, where this nucleic acid molecule functions in jaw, cartilage, head, or eye development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 63 and 64 of SEQ ID NO:123.

[0060] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the TCP-1 Eta nucleic acid sequence of SEQ ID NO:125 over at least 1600 contiguous nucleic acids, where this nucleic acid molecule functions in head, brain, or eye development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 32 and 33 of SEQ ID NO:125.

[0061] In a further aspect, the invention features an isolated nucleic acid molecule including the Translation Elongation Factor eEF1 Alpha nucleic acid sequence of SEQ ID NO:127, where this nucleic acid molecule functions in cell death regulation in the head or eye. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 60 and 61 of SEQ ID NO:127.

[0062] In a further aspect, this invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 40% identity to the 1257 nucleic acid sequence of SEQ ID NO:129 over at least 500 contiguous nucleic acids where this nucleic acid molecule functions in head, eye, or jaw development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 175 of SEQ ID NO:129.

[0063] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 85% nucleic acid sequence identity to the 60S Ribosomal Protein L24 nucleic acid sequence of SEQ ID NO:131 over at least 400 contiguous nucleic acids, where this nucleic acid molecule functions in head or eye development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 144 and 145 of SEQ ID NO:131.

[0064] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the Non-Muscle Adenylosuccinate Synthase nucleic acid sequence of SEQ ID NO:133 over at least 350 contiguous nucleic acids, where this nucleic acid molecule functions in cell death regulation, or in jaw or cartilage development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 217 and 218 of SEQ ID NO:133, or one at a nucleotide corresponding to nucleotide 209 of SEQ ID NO:133.

[0065] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 75% nucleic acid sequence identity to the Nuclear Cap Binding Protein Subunit 2 nucleic acid sequence of SEQ ID NO:135 over at least 450 contiguous nucleic acids, where this nucleic acid molecule functions in head, eye, CNS, jaw, cartilage, or stomach development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 137 and 138 of SEQ ID NO:135.

[0066] In a further aspect, the invention features an isolated nucleic acid molecule including the Ornithine Decarboxylase nucleic acid sequence of SEQ ID NO:137, where this nucleic acid molecule functions in jaw or cartilage development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 97 and 98 of SEQ ID NO:137.

[0067] In a further aspect, this invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the full length Protein Phosphatase 1 Nuclear Targeting Subunit nucleic acid sequence of SEQ ID NO:139 over at least 250 contiguous nucleic acids, where this nucleic acid molecule functions in head, jaw, body, or gut development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 303 of SEQ ID NO:139.

[0068] In a further aspect, this invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the Mitochondrial Inner Membrane Translocating (rTIM23) nucleic acid sequence of SEQ ID NO:141 over at least 450 contiguous nucleic acids, where this nucleic acid molecule functions in eye pigmentation or in the development of the vascular system. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 100 of SEQ ID NO:141.

[0069] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the 1447 nucleic acid sequence of SEQ ID NO:143 over at least 950 contiguous nucleic acids, where this nucleic acid molecule functions in pancreas, tail, stomach, cartilage, or limb development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 227 and 228 of SEQ ID NO:143.

[0070] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the ARS2 nucleic acid sequence of SEQ ID NO:145 over at least 650 contiguous nucleic acids, where this nucleic acid molecule functions in pigment, tectum, jaw, or ear development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 103 and 104 of SEQ ID NO:145.

[0071] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 80% nucleic acid sequence identity to the BAF53a nucleic acid sequence of SEQ ID NO:149 over at least 1300 contiguous nucleic acids, where this nucleic acid molecule functions in brain, body, or eye development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one at a nucleotide corresponding to nucleotide 160 of SEQ ID NO:149.

[0072] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 85% nucleic acid sequence identity to the Histone Deacetylase nucleic acid sequence of SEQ ID NO:151 over at least 1300 contiguous nucleic acids, where this nucleic acid molecule functions in heart, eye, ear, semicircular canal, jaw, limb, or cartilage development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 98 and 99 of SEQ ID NO:151, or at a nucleotide corresponding to nucleotide 88 of SEQ ID NO:151.

[0073] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 85% nucleic acid sequence identity to the Fibroblast Isoform of the ADP/ATP Carrier Protein nucleic acid sequence of SEQ ID NO: 153 over at least 900 contiguous nucleic acids, where this nucleic acid molecule functions in lung development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 178 and 179 of SEQ ID NO:153.

[0074] In a further aspect, the invention features an isolated nucleic acid molecule including a nucleic acid sequence having at least 75% nucleic acid sequence identity to the TAFII-55 nucleic acid sequence of SEQ ID NO:155 over at least 600 contiguous nucleic acids, where this nucleic acid molecule functions in head, eye, or lung development. In addition, this isolated nucleic acid molecule may include a mutation, for example, a viral insertion, such as, one between nucleotides corresponding to nucleotides 107 and 108 of SEQ ID NO:155.

[0075] In a further aspect, the invention features an isolated nucleic acid molecule including a Neurogenin Related Protein-1 nucleic acid sequence of SEQ ID NO:11, where this nucleic acid molecule further includes a viral insertion at a nucleotide corresponding to nucleotide 1149 of SEQ ID NO:11.

[0076] In a sixty fourth aspect, the invention features an isolated nucleic acid molecule including a Cad-1 nucleic acid sequence of SEQ ID NO:13, where this nucleic acid molecule further includes a viral insertion between nucleotides corresponding to nucleotides 583 and 584 of SEQ ID NO:13.

[0077] In a further aspect, the invention features an isolated nucleic acid molecule including a CopZ1 nucleic acid sequence of SEQ ID NO:29, where this nucleic acid molecule further includes a viral insertion between nucleotides corresponding to nucleotides 90 and 91 of SEQ ID NO:29.

[0078] In a further aspect, the invention features an isolated nucleic acid molecule including a Ribonucleotide Reductase Protein R1 Class 1 nucleic acid sequence of SEQ ID NO:55, where this nucleic acid molecule further includes a viral insertion between nucleotides corresponding to nucleotides 147 and 148 of SEQ ID NO:55.

[0079] In a further aspect, the invention features an isolated nucleic acid molecule including an Aryl Hydrocarbon Receptor Nuclear Transporter 2A nucleic acid sequence of SEQ ID NO:63, where this nucleic acid molecule further includes a viral insertion at a nucleotide corresponding to nucleotide 229 or 240 of SEQ ID NO:63.

[0080] In a further aspect, the invention features an isolated nucleic acid molecule including a Ribonucleotide Reductase Protein R2 nucleic acid sequence of SEQ ID NO:75, where this nucleic acid molecule further includes a viral insertion at a nucleotide corresponding to nucleotide 137 of SEQ ID NO:75, or to a nucleotide corresponding to nucleotide 337 or 342 of GenBank Accession No. AW280665.

[0081] In a further aspect, the invention features an isolated nucleic acid molecule including a TCP-1 Alpha nucleic acid sequence of SEQ ID NO:77, where this nucleic acid molecule further includes a viral insertion between nucleotides corresponding to nucleotides 130 and 131 of SEQ ID NO:77, or to a nucleotide corresponding to nucleotide 140 bp upstream of nucleotide 64 of SEQ ID NO:77.

[0082] In a further aspect, the invention features an isolated nucleic acid molecule including a Cyclin A2 nucleic acid sequence of SEQ ID NO:109, where this nucleic acid molecule further includes a viral insertion at a nucleotide corresponding to nucleotide 374 or 401 of SEQ ID NO:109.

[0083] In a further aspect, the invention features an isolated nucleic acid molecule including a Sec61 Alpha nucleic acid sequence of SEQ ID NO:147, where this nucleic acid molecule further includes a viral insertion between nucleotides corresponding to nucleotides 132 and 133 of SEQ ID NO:147.

[0084] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 95% sequence identity to the 904 amino acid sequence of SEQ ID NO:2 over at least 160 contiguous amino acids, where this polypeptide functions in CNS development or neural tissue proliferation.

[0085] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 95% sequence identity to the U2AF amino acid sequence of SEQ ID NO:8 over at least 250 contiguous amino acids, where this polypeptide functions in brain or tectum development.

[0086] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 95% sequence identity to the 954 amino acid sequence of SEQ ID NO:10, where this polypeptide functions in cartilage development.

[0087] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 80% sequence identity to the V-ATPase Alpha Subunit amino acid sequence of SEQ ID NO:16 over at least 226 contiguous amino acids, where this polypeptide functions in body or eye pigmentation.

[0088] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 90% sequence identity to the V-ATPase SFD Subunit amino acid sequence of SEQ ID NO:18 over at least 450 contiguous amino acids, where this polypeptide functions in body or eye pigmentation.

[0089] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 95% sequence identity to the V-ATPase 16 kDa Proteolytic Subunit amino acid sequence of SEQ ID NO:20 over at least 150 contiguous amino acids, where this polypeptide functions in body or eye pigmentation, or touch sensitivity.

[0090] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 50% sequence identity to the 1463 amino acid sequence of SEQ ID NO:158 over at least 475 contiguous amino acids, where this polypeptide functions in brain development, body pigmentation, or vascular development.

[0091] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 70% sequence identity to the VPSP18 amino acid sequence of SEQ ID NO:22, over at least 550 contiguous amino acids, where this polypeptide functions in pigmentation, photoreceptor, retina, or tectum development.

[0092] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 95% sequence identity to the 60S Ribosomal Protein L35 amino acid sequence of SEQ ID NO:26 over at least 100 contiguous amino acids, where this polypeptide functions in head, eye, or brain development.

[0093] In a further aspect, the invention features an isolated polypeptide including a sequence having at least 80% sequence identity to the 215 amino acid sequence of SEQ ID NO:32 over at least 529 contiguous amino acids, where this polypeptide functions in eye or jaw development.

[0094] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 60% sequence identity to the 307 amino acid sequence of SEQ ID NO:34 over at least 200 contiguous amino acids, where this polypeptide functions in jaw or cartilage development.

[0095] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 40% sequence identity to the 572 amino acid sequence of SEQ ID NO:36 over at least 200 contiguous amino acids, where this amino acid functions in jaw or cartilage development.

[0096] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 45% sequence identity to the 1116A amino acid sequence of SEQ ID NO:38 over at least 200 contiguous amino acids, where this polypeptide functions in jaw development.

[0097] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 76% sequence identity to the 1548 amino acid sequence of SEQ ID NO:40 over at least 950 contiguous amino acids, where this polypeptide functions in eye, head, heart, limb, jaw, or neurocranium development.

[0098] In a further aspect, the invention features an isolated polypeptide including a Casein Kinase 1α amino acid sequence of SEQ ID NO:42, where this polypeptide functions in jaw, limb, or cartilage development.

[0099] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 55% sequence identity to the 429 amino acid sequence of SEQ ID NO:48 over at least 750 contiguous amino acids, where this polypeptide functions in vascular, liver, gall bladder, pancreas, or gut development.

[0100] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 65% sequence identity to the 428 amino acid sequence of SEQ ID NO:50 over at least 175 contiguous amino acids, where this polypeptide functions in brain or muscle development.

[0101] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 70% sequence identity to the Spinster amino acid sequence of SEQ ID NO:52 over at least 500 contiguous amino acids, where this polypeptide functions in fertility.

[0102] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 60% amino acid sequence identity to the Kinesin-Related Motor Protein EG5 amino acid sequence of SEQ ID NO:58 over at least 900 contiguous amino acids, where this polypeptide functions in cell death regulation or body development.

[0103] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 75% sequence identity to the 459 amino acid sequence of SEQ ID NO:60 over at least 250 contiguous amino acids, where this polypeptide functions in body development or cell death regulation.

[0104] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 65% sequence identity to the Vesicular Integral-Membrane Protein VIP 36 amino acid sequence of SEQ ID NO:66 over at least 320 contiguous amino acids, where this polypeptide functions in touch sensitivity.

[0105] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 50% sequence identity to the 299 amino acid sequence of SEQ ID NO:68 over at least 500 contiguous amino acids, where this polypeptide functions in cell death regulation in the eye or brain, or in jaw, cartilage, or limb development.

[0106] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 40% sequence identity to the 994 amino acid sequence of SEQ ID NO:70 over at least 500 contiguous amino acids, where this polypeptide functions in eye, head, jaw, cartilage, or stomach development.

[0107] In a further aspect, the invention features an isolated polypeptide including at least 110 contiguous amino acids of the 1373 amino acid sequence of SEQ ID NO:72, where this polypeptide functions in brain, eye, or body development.

[0108] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 70% sequence identity to the Denticleless amino acid sequence of SEQ ID NO:74 over at least 400 contiguous amino acids, where this amino acid functions in CNS, body, yolk sac, somite, or eye development.

[0109] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 35% sequence identity to the Telomeric Repeat Factor 2 amino acid sequence of SEQ ID NO:80 over at least 200 contiguous amino acids, where this polypeptide functions in brain or eye development.

[0110] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 40% sequence identity to the SIL amino acid sequence of SEQ ID NO:82 over at least 1200 contiguous amino acids, where this polypeptide functions in brain, head, or body development, or in motility.

[0111] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 87% sequence identity to the U1 the Small Nuclear Ribonucleoprotein C polypeptide of SEQ ID NO:84 over at least 150 contiguous amino acids, where this polypeptide functions in body, eye, hindbrain, ear, pigment, or limb development.

[0112] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 87% sequence identity to the Ski Interacting Polypeptide amino acid sequence of SEQ ID NO:86 over at least 500 contiguous amino acids, where this polypeptide functions in body or brain development, or in motility.

[0113] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 80% sequence identity to the 297 amino acid sequence of SEQ ID NO:88 over at least 600 contiguous amino acids, where this polypeptide functions in motility, cartilage, cranium, eye, tail, or brain development.

[0114] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 90% sequence identity to the TCP-1 Complex Gamma Chain amino acid sequence of SEQ ID NO:90 over at least 500 contiguous amino acids, where this polypeptide functions in yolk sac development.

[0115] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 99% sequence identity to Small Nuclear Ribonucleoprotein D1 amino acid sequence of SEQ ID NO:92, where this polypeptide functions in CNS or eye development.

[0116] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 76% sequence identity to the DNA polymerase Epsilon Subunit B amino acid sequence of SEQ ID NO:94 over at least 500 contiguous amino acids, where this polypeptide functions in brain or eye development.

[0117] In a further aspect, the invention features an isolated polypeptide including a amino acid sequence having at least 60% sequence identity over the full length of the 821-02 amino acid sequence of SEQ ID NO:96; where this polypeptide functions in cell death regulation in the CNS or the eye.

[0118] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 78% sequence identity to the 1045 amino acid sequence of SEQ ID NO:98 over at least 300 contiguous amino acids, where this polypeptide functions in brain or head development.

[0119] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 75% sequence identity to the 1055-1 amino acid sequence of SEQ ID NO:100 over at least 300 contiguous amino acids, where this polypeptide functions in cell cycle progression or 60S ribosomal subunit biogenesis.

[0120] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 85% sequence identity to the Spliceosome Associated Protein 49 polypeptide of amino acid sequence SEQ ID NO:102 over at least 350 contiguous amino acids, where this polypeptide functions in tectal or body development.

[0121] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 80% sequence dentity to the DNA Replication Licensing factor MCM7 amino acid sequence of SEQ ID NO:104 over at least 190 contiguous amino acids, where this polypeptide functions in eye or CNS development.

[0122] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 85% sequence identity to the DEAD-Box RNA Helicase (DEAD5 or DEAD19) amino acid sequence of SEQ ID NO:106 over at least 450 contiguous amino acids, where this polypeptide functions in brain development.

[0123] In a further aspect, the invention features an isolated polypeptide including a amino acid acid sequence having at least 50% sequence identity to the 1581 amino acid sequence of SEQ ID NO:108 over at least 300 contiguous amino acids, where this polypeptide functions in head or eye development.

[0124] In a further aspect, the invention features an isolated polypeptide including an amino acid acid sequence having at least 75% sequence identity to Imitation Switch (ISWI)/SNF2 amino acid sequence of SEQ ID NO:112 over at least 150 contiguous amino acids, where this polypeptide functions in development of the visual system or of cartilage.

[0125] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 75% sequence identity over the full length of the Chromosomal Assembly Protein C (XCAP-C) amino acid sequence of SEQ ID NO:114, where this polypeptide functions in optic tectum, eye, or hindbrain development.

[0126] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 83% sequence identity over the full length of the DNA Replication Licensing Factor MCM2 amino acid sequence of SEQ ID NO:116, where this polypeptide functions in eye, optic tectum, jaw, or cartilage development.

[0127] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 88% sequence identity to the DNA Replication Licensing Factor MCM3 amino acid sequence of SEQ ID NO:118 over at least 170 contiguous amino acids, where this polypeptide functions in optic tectum, head, or eye development.

[0128] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 55% sequence identity to the Valyl-tRNA synthase amino acid sequence of SEQ ID NO:120 over at least 450 contiguous amino acids, where this polypeptide functions in cell death regulation, head or eye development, or pigmentation.

[0129] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 98% sequence identity to the 40S Ribosomal Protein S5 amino acid sequence of SEQ ID NO:122 over at least 210 contiguous amino acids, where this polypeptide functions in brain development or motility.

[0130] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 90% sequence identity to the TCP-1 Beta amino acid sequence of SEQ ID NO:124 over at least 500 contiguous amino acids, where this polypeptide functions in jaw, head, eye, or cartilage development.

[0131] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 90% sequence identity to the TCP-1 Eta amino acid sequence of SEQ ID NO:126 over at least 500 contiguous amino acids, where this polypeptide functions in head and eye development.

[0132] In a further aspect, the invention features an isolated polypeptide including the amino acid sequence of Translation Elongation Factor eEF Alpha amino acid sequence of SEQ ID NO:128, where this polypeptide functions in cell death regulation in the head or eye.

[0133] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 55% sequence identity to the 1257 amino acid sequence of SEQ ID NO:130 over at least 350 contiguous amino acids, where this polypeptide functions in head, eye, or jaw development.

[0134] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 90% sequence identity to the 60S Ribosomal Protein L24 amino acid sequence of SEQ ID NO:132 over at least 160 contiguous amino acid, where this polypeptide functions in head or eye development.

[0135] In a further aspect, the invention features an isolated polypeptide including a amino acid sequence having at least 80% sequence identity to the Non-Muscle Adenylosuccinate. Synthase amino acid sequence of SEQ ID NO:134, where this polypeptide functions in cell death regulation, or in jaw or cartilage development.

[0136] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 90% sequence identity to the Nuclear Cap Binding Protein Subunit 2 amino acid sequence of SEQ ID NO:136, over at least 150 contiguous amino acids, where this polypeptide functions in head, eye, CNS, jaw, cartilage, or stomach development.

[0137] In a further aspect, the invention features an isolated polypeptide including the Omithine Decarboxylase amino acid sequence of SEQ ID NO:138, where this polypeptide functions in jaw or cartilage development.

[0138] In a further aspect, the invention features an isolated polypeptide including a amino acid sequence having at least 60% sequence identity to the Protein Phosphatase 1 Nuclear Targeting Subunit amino acid sequence of SEQ ID NO:140 over at least 600 contiguous amino acids, where this polypeptide functions in head, jaw, body, or gut development.

[0139] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 80% sequence identity to the Mitochondrial Inner Membrane Translocating (rTIM23) amino acid sequence of SEQ ID NO:142 over at least 175 contiguous amino acids, where this polypeptide functions in the development of the vascular system or in eye pigmentation.

[0140] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 65% sequence identity to the 1447 amino acid sequence of SEQ ID NO:144 over at least 700 contiguous amino acids, where this polypeptide functions in pancreas, tail, stomach, or limb development.

[0141] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 72% sequence identity to the ARS2 amino acid sequence of SEQ ID NO:146 over at least 900 contiguous amino acids, where this polypeptide functions in pigment, tectum, cartilage, jaw, or ear development.

[0142] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 90% sequence identity over the full length of the BAF53a amino acid sequence of SEQ ID NO:150, where this polypeptide functions in brain, body, or eye development.

[0143] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 92% sequence identity to the Histone Deacetylase amino acid sequence of SEQ ID NO:152, where this polypeptide functions in heart, eye, ear, semicircular canal, jaw, limb, or cartilage development.

[0144] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 95% sequence identity to the Fibroblast Isoform of the ADP/ATP Carrier protein amino acid sequence of SEQ ID NO:154 over at least 300 contiguous amino acids, where this polypeptide functions in lung development.

[0145] In a further aspect, the invention features an isolated polypeptide including an amino acid sequence having at least 70% sequence identity to the TAFII-55 amino acid sequence of SEQ ID NO:156, where this polypeptide functions in head, lung, or eye development.

[0146] In a further aspect, the invention features an isolated nucleic acid (i) encoding the amino acid sequence of SEQ ID NO:2, 8, 10, 16, 18, 20, 158, 22, 26, 32, 34, 36, 38, 40, 42, 48, 50, 52, 58, 60, 66, 68, 70, 72, 74, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 150, 152, 154, or 156; (ii) an isolated nucleic acid molecule having the nucleic acid sequence of SEQ ID NO:1, 7, 9, 15, 17, 19, 157, 21, 25, 31, 33, 35, 37, 39, 41, 47, 49, 51, 57, 59, 65, 67, 69, 71, 73, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 149, 151, 153, or 155; (iii) an isolated nucleic acid molecule that hybridizes under highly stringent conditions to a probe having the nucleic acid sequence of SEQ ID NO:1, 7, 9, 15, 17, 19, 157, 21, 25, 31, 33, 35, 37, 39, 41, 47 49, 51, 57, 59, 65, 67, 69, 71, 73, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 149, 151, 153, or 155 or a portion thereof; or (iv) an isolated nucleic acid molecule complementary to the isolated nucleic acid molecule of (i), (ii), or (iii). For example, the nucleic acid molecule may be a vertebrate, e.g., human, mouse, or zebrafish nucleic acid molecule. In addition, the invention also features a vector including an isolated nucleic acid molecule of the invention, for example, one operably linked to a promoter, as well as a cell including such a vector.

[0147] In a further aspect, the invention features an isolated polypeptide including the amino acid sequence of SEQ ID NO:2, 8, 10, 16, 18, 20, 158, 22, 26, 32, 34, 36, 38, 40, 42, 48, 50, 52, 58, 60, 66, 68, 70, 72, 74, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 150, 152, 154, or 156; having the amino acid sequence of SEQ ID NO:2, 8, 10, 16, 18, 20, 158, 22, 26, 32, 34, 36, 38, 40, 42, 48, 50, 52, 58, 60, 66, 68, 70, 72, 74, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 150, 152, 154, or 156; or a polypeptide that is encoded by an isolated nucleic acid molecule including the nucleic acid sequence of SEQ ID NO:1, 7, 9, 15, 17, 19, 157, 21, 25, 31, 33, 35, 37, 39, 41, 47, 49, 51, 57, 59, 65, 67, 69, 71, 73, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 149, 151, 153, or 155.

[0148] In a further aspect, the invention features a method of treating or preventing a proliferative disorder in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:1, 57, 59, 67, 95, 99, 119, or 127 its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a 904, 1055-1, Kinesin Related Motor Protein EG5, 459, 299, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, 821-02 nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a proliferative disorder. In addition, this nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:1, 57, 59, 67, 95, 99, 119, or 127. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:1, 57, 59, 67, 95, 99, 119, or 127.

[0149] In a further aspect, the invention features a method of treating or preventing a bone, connective tissue, or cartilage disorder in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:9, 31, 33, 35, 37, 39, 41, 67, 69, 123, 133, 135, 137, 143, or 151 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a 954, histone deacetylase, 215, 307, 572, 1116A, 1548, Casein kinase 1α, 299, 994, TCP-1 Beta, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, or 1447 nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a bone, connective tissue, or cartilage disorder. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:9, 31, 33, 35, 37, 39, 41, 67, 69, 123, 133, 135, 137, 143, or 151. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:9, 31, 33, 35, 37, 39, 41, 67, 69, 123, 133, 135, 137, 143, or 151.

[0150] In a further aspect, the invention features a method of treating or preventing a cell death disorder in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:7, 57, 59, 67, 71, 73, 79, 81, 83, 85, 87, 91, 93, 95, 97, 101, 103, 105, 107, 111, 113, 115, 117, 119, 127, 133, 135, 137, 139, or 145 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a U2AF, Kinesin Related Motor Protein EG5, 459, 299, 1373, Denticleless, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonuclear Protein C, Ski Interacting Protein, 297, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, 1581, DEAD-Box RNA Helicase (DEAD5 or DEAD19), ISWI/SNF2, XCAP-C, DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, ARS2, Protein Phosphatase 1 Nuclear Targeting Subunit, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, Non-Muscle Adenylosuccinate Synthase, or 821-02 nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a cell death disorder. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:7, 57, 59, 67, 71, 73, 79, 81, 83, 85, 87, 91, 93, 95, 97, 101, 103, 105, 107, 111, 113, 115, 117, 119, 127, 133, 135, 137, 139, or 145. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:7, 57, 59, 67, 71, 73, 79, 81, 83, 85, 87, 91, 93, 95, 97, 101, 103, 105, 107, 111, 113, 115, 117, 119, 127, 133, 135, 137, 139, or 145.

[0151] In a further aspect, the invention features a method of treating or preventing a circulatory disorder in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:1, 47, 141, or 157 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a 904, 1463, 429, or Mitochondrial Inner Membrane Translocating (rTIM23) nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a circulatory disorder. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:1, 47, 141, or 157. Alternatively, the nucleic acid may be a double-stranded RNA molecule, or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:1, 47, 141, or 157.

[0152] In a further aspect, the invention features a method of treating or preventing a craniofacial defect in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:31, 33, 35, 37, 39, 41, 67, 87, 115, 123, 129, 133, 135, or 137 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a 215, 307, 572, 1116A, 1548, Casein Kinase 1α, 299, 297, DNA Replication Licensing Factor MCM2, 1257, TCP-1 Beta, Nuclear Cap Binding Protein Subunit 2, Non-Muscle Adenylosuccinate Synthase, or Ornithine Decarboxylase nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a craniofacial defect. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:31, 33, 35, 37, 39, 41, 67, 87, 115, 123, 129, 133, 135, or 137. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:31, 33, 35, 37, 39, 41, 67, 87, 115, 123, 129, 133, 135, or 137.

[0153] In a further aspect, the invention features a method of treating or preventing a hearing disorder in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:83, 125, 145, or 151 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a U1 Small Nuclear Ribonucleoprotein C, ARS2, TCP-1 Eta, or Histone Deacetylase nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a hearing disorder. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:83, 125, 145, or 151. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:83, 125, 145, or 151.

[0154] In a further aspect, the invention features a method of treating or preventing diabetes in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:47 or 143 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a 429 or 1447 nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents diabetes. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting results in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:47 or 143. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:47 or 143.

[0155] In a further aspect, the invention features a method of treating or preventing a heart defect, disease, or disorder in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:39, 77, 135, 141, or 151 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a 1548, Histone Deacetylase, or Nuclear Cap Binding Protein Subunit 2, Mitochondrial Inner Membrane Translocating Protein (rTIM23), or TCP-1 Alpha nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a heart defect, disease, or disorder. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:25, 39, 77, 135, 141, or 151. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:25, 39, 77, 135, 141, or 151.

[0156] In a further aspect, the invention features a method of treating or preventing infertility in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:51 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a Spinster nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents infertility. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:51. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:51.

[0157] In a further aspect, the invention features a method of treating or preventing a limb formation defect in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:41 or 151 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a Casein Kinase 1α or Histone Deacetylase nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a limb formation defect. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:41 or 151. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:41 or 151.

[0158] In a further aspect, the invention features a method of treating or preventing mental retardation in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:7, 21, 25, 67, 69, 71, 79, 81, 83, 85, 89, 91, 93, 97, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 127, 129, 131, 133, 135, 137, 139, 143, 145, 149, 151, 155, or 157 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a U2AF, 1463, VPSP18, 60S Ribosomal Protein L35, 299, 1373, Telomeric Repeat Factor 2, SIL, U1 Small Ribonucleoprotein C, SKIP, 297, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, ISWI/SNF, Chromosomal Assembly Protein (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valy-tRNA Synthase, 40S Ribosomal. Protein S5, TCP-1 Alpha, TCP-1 Beta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, ARS2, BAF53a, Histone Deacetylase, Protein Phosphatase 1 Nuclear Targeting Subunit protein (PNUTS), Ornithine Decarboxylase, 1447, 1262, 994, or TAFII-55 nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents mental retardation. In addition, the nucleic acid may be a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:7, 21, 25, 67, 69, 71, 79, 81, 83, 85, 89, 91, 93, 97, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 127, 129, 131, 133, 135, 137, 139, 143, 145, 149, 151, 155, or 157. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:7, 21, 25, 67, 69, 71, 79, 81, 83, 85, 89, 91, 93, 97, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 127, 129, 131, 133, 135, 137, 139, 143, 145, 149, 151, 155, or 157.

[0159] In a further aspect, the invention features a method of treating or preventing a muscle defect in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:49 or 73 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a 428 or Denticleless nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a muscle defect. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:49 or 73. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:49 or 73.

[0160] In a further aspect, the invention features a method of treating or preventing a neurodegenerative disorder in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:19, 57, 59, 65, 67, 87, 95, 119, 121, or 127 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a Vesicular Integral Membrane Protein VIP 36, 297, 40S Ribosomal Protein S5, V-ATPase 16 kDa Proteolytic Subunit, Kinesin Related Motor Protein EG5, 459, 299, 821-02, Valyl-tRNA Synthase, or Translation Elongation Factor eEF1 Alpha nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a neurodegenerative disorder. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:19, 57, 65, 67, 69, 87, 95, 119, 121, or 127. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:19, 57, 59, 65, 67, 87, 95, 119, 121, or 127.

[0161] In a further aspect, the invention features a method of treating or preventing stroke in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:1, 141, or 157 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a 904, Mitochondrial Inner Membrane Translocating (rTIM23), or 1463 nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents stroke. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:1, 141, or 157. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:1, 141, or 157.

[0162] In a further aspect, the invention features a method of treating or preventing a stem cell regeneration disorder in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:1, 141, or 143 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a 904, Mitochondrial Inner Membrane Translocating (rTIM23), or 1447 nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a stem cell regeneration disorder. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:1, 141, or 143. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:1, 141, or 143.

[0163] In a further aspect, the invention features a method of treating or preventing a visual defect in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:15, 17, 19, 21, 55, 79, 81, 87, 111, or 157 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a V-ATPase SFD subunit, V-ATPase 16 kDa Proteolytic Subunit, V-ATPase Alpha Subunit, 1463, VPSP8, Ribonucleotide Reductase R1 Class 1, Telomeric Repeat Factor 2, SIL, ISWI/SNF2 or 297 nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a visual defect. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:15, 17, 19, 21, 55, 79, 81, 87, 111, or 157. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:15, 17, 19, 21, 55, 79, 81, 87, 111, or 157.

[0164] In a further aspect, the invention features a method of treating or preventing a pulmonary disorder in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:153 or 155 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of an Fibroblast Isoform of the ADP/ATP Carrier Protein or TAFII-55 nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a pulmonary disorder. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:153 or 155. Alternatively, the nucleic acid may be a double-stranded RNA or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:153 or 155.

[0165] In a further aspect, the invention features a method of treating or preventing a movement disorder in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:11, 13, 63, 81, 83, 85, 121 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a Cad-1, Aryl Hydrocarbon Receptor Nuclear Translocator 2A, or 40S Ribosomal Protein S5, Neurogenin Related Protein 1, SIL, U1 Small Nuclear Ribonucleoprotein C, Ski Interacting Protein nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a movement disorder. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:11, 13, 63, 81, 83, 85, or 121. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:11, 13, 63, 81, 83, 85, or 121.

[0166] In a further aspect, the invention features a method of treating or preventing a somite formation disorder in an organism, the method including the step of contacting the organism with a therapeutically effective amount of a nucleic acid including the nucleic acid sequence of SEQ ID NO:49 or 73 or its complement, where the nucleic acid is sufficient to elicit an alteration in expression of a 428 or Denticleless nucleic acid sequence in the organism, and where this alteration in the level of expression treats or prevents a somite formation disorder. In addition, the nucleic acid may be, for example, a cDNA or an mRNA molecule and the contacting may result in an increase in expression of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO:49 or 73. Alternatively, the nucleic acid may be a double-stranded RNA molecule or an anti-sense RNA molecule, and the contacting may lead to a decrease in expression, or may inhibit biological activity, of a nucleic acid sequence including SEQ ID NO:49 or 73.

[0167] In a further aspect, the invention features a method for diagnosing a proliferative disorder or the propensity to develop a proliferative disorder in an organism. This method includes detecting an alteration in the level of 904, 1055-1, Kinesin Related Motor Protein EG5, 459, 299, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, 821-02 polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a 904, 1055-1, Kinesin Related Motor Protein EG5, 459, 299, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, 821-02 polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the 904, 1055-1, Kinesin Related Motor Protein EG5, 459, 299, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, 821-02 polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a proliferative disorder. In a further aspect, the invention features a method for diagnosing a proliferative disorder or the propensity to develop a proliferative disorder in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a 904, 1055-1, Kinesin Related Motor Protein EG5, 459, 299, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, 821-02 nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a 904, 1055-1, Kinesin Related Motor Protein EG5, 459, 299, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, 821-02 nucleic acid molecule derived from a second, control organism, where an alteration of the 904, 1055-1, Kinesin Related Motor Protein EG5, 459, 299, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, 821-02 sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a proliferative disorder. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:1, 57, 59, 67, 95, 99, 119, or 127.

[0168] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a proliferative disorder. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:2, 58, 60, 68, 96, 100, 120, or 128 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a proliferative disorder.

[0169] In a further aspect, the invention features a method for diagnosing a bone, connective tissue, or cartilage formation disorder or the propensity to develop a bone, connective tissue, or cartilage formation disorder in an organism. This method includes detecting an alteration in the level of 954, histone deacetylase, 215, 307, 572, 1116A, 1548, Casein kinase 1α, 299, 994, TCP-1 Beta, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, or 1447 polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a 954, histone deacetylase, 215, 307, 572, 1116A, 1548, Casein kinase 1α, 299, 994, TCP-1 Beta, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, or 1447 polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of 954, histone deacetylase, 215, 307, 572, 1116A, 1548, Casein kinase 1α, 299, 994, TCP-1 Beta, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Omithine Decarboxylase, or 1447 polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a bone, connective tissue, or cartilage formation disorder.

[0170] In a further aspect, the invention features a method for diagnosing a bone, connective tissue, or cartilage formation disorder or the propensity to develop a bone, connective tissue, or cartilage formation disorder in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a 954, histone deacetylase, 215, 307, 572, 1116A, 1548, Casein kinase 1α, 299, 994, TCP-1 Beta, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, or 1447 nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a 954, histone deacetylase, 215, 307, 572, 1116A, 1548, Casein kinase 1α, 299, 994, TCP-1 Beta, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, or 1447 nucleic acid molecule derived from a second, control organism, where an alteration of the 954, histone deacetylase, 215, 307, 572, 1116A, 1548, Casein kinase 1α, 299, 994, TCP-1 Beta, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, or 1447 sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a bone, connective tissue, or cartilage formation disorder. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:9, 31, 33, 35, 37, 39, 41, 67, 69, 123, 133, 135, 137, 143, or 151.

[0171] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a bone, connective tissue, or cartilage formation disorder. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:10, 32, 34, 36, 38, 40, 42, 68, 70, or 152 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a bone, connective tissue, or cartilage formation disorder.

[0172] In a further aspect, the invention features a method for diagnosing a cell death disorder or the propensity to develop a cell death disorder in an organism. This method includes detecting an alteration in the level of U2AF, Kinesin Related Motor Protein EG5, 459, 299, 1373, Denticleless, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonuclear Protein C, Ski Interacting Protein, 297, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, 1581, DEAD-Box RNA Helicase (DEAD5 or DEAD19), ISWI/SNF2, XCAP-C, DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, ARS2, Protein Phosphatase 1 Nuclear Targeting Subunit, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, Non-Muscle Adenylosuccinate Synthase, or 821-02 polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a U2AF, Kinesin Related Motor Protein EG5, 459, 299, 1373, Denticleless, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonuclear Protein C, Ski Interacting Protein, 297, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, 1581, DEAD-Box RNA Helicase (DEAD5 or DEAD19), ISWI/SNF2, XCAP-C, DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Nuclear Cap Binding Protein Subunit 2, Omithine Decarboxylase, ARS2, Protein Phosphatase 1 Nuclear Targeting Subunit, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, Non-Muscle Adenylosuccinate Synthase, or 821-02 polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the U2AF, Kinesin Related Motor Protein EG5, 459, 299, 1373, Denticleless, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonuclear Protein C, Ski Interacting Protein, 297, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, 1581, DEAD-Box RNA Helicase (DEAD5 or DEAD19), ISWI/SNF2, XCAP-C, DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Nuclear Cap Binding Protein Subunit 2, Omithine Decarboxylase, ARS2, Protein Phosphatase 1 Nuclear Targeting Subunit, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, Non-Muscle Adenylosuccinate Synthase, or 821-02 polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a cell death disorder.

[0173] In a further aspect, the invention features a method for diagnosing a cell death disorder or the propensity to develop a cell death disorder in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a U2AF, Kinesin Related Motor Protein EG5, 459, 299, 1373, Denticleless, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonuclear Protein C, Ski Interacting Protein, 297, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, 1581, DEAD-Box RNA Helicase (DEAD5 or DEAD19), ISWI/SNF2, XCAP-C, DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, ARS2, Protein Phosphatase 1 Nuclear Targeting Subunit, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, Non-Muscle Adenylosuccinate Synthase, or 821-02 nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a U2AF, Kinesin Related Motor Protein EG5, 459, 299, 1373, Denticleless, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonuclear Protein C, Ski Interacting Protein, 297, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, 1581, DEAD-Box RNA Helicase (DEAD5 or DEAD19), ISWI/SNF2, XCAP-C, DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, ARS2, Protein Phosphatase 1 Nuclear Targeting Subunit, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, Non-Muscle Adenylosuccinate Synthase, or 821-02 nucleic acid molecule derived from a second, control organism, where an alteration of the U2AF, Kinesin Related Motor Protein EG5, 459, 299, 1373, Denticleless, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonuclear Protein C, Ski Interacting Protein, 297, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, 1581, DEAD-Box RNA Helicase (DEAD5 or DEAD19), ISWI/SNF2, XCAP-C, DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, ARS2, Protein Phosphatase 1 Nuclear Targeting Subunit, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, Non-Muscle Adenylosuccinate Synthase, or 821-02 sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a cell death disorder. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:7, 57, 59, 67, 71, 73, 79, 81, 83, 85, 87, 91, 93, 95, 97, 101, 103, 105, 107, 111, 113, 115, 117, 119, 127, 133, 135, 137, 139, or 145.

[0174] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a cell death disorder. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:8, 58, 60, 68, 72, 74, 80, 82, 84, 86, 88, 92, 94, 98, 102, 104, 106, 108, 112, 114, 116, 118, 120, 128, 134, 136, 138, 140, or 146 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a cell death disorder.

[0175] In a further aspect, the invention features a method for diagnosing a circulatory disorder or the propensity to develop a circulatory disorder in an organism. This method includes detecting an alteration in the level of 904, 1463, 429, or Mitochondrial Inner Membrane Translocating (rTIM23) polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a 904, 1463, 429, or Mitochondrial Inner Membrane Translocating (rTIM23) polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the 904, 1463, 429, or Mitochondrial Inner Membrane Translocating (rTIM23) polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a circulatory disorder.

[0176] In a further aspect, the invention features a method for diagnosing a circulatory disorder or the propensity to develop a circulatory disorder in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a 904, 1463, 429, or Mitochondrial Inner Membrane Translocating (rTIM23) nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a 904, 1463, 429, or Mitochondrial Inner Membrane Translocating (rTIM23) nucleic acid molecule derived from a second, control organism, where an alteration of the 904, 1463, 429, or Mitochondrial Inner Membrane Translocating (rTIM23) sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a circulatory disorder. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:1, 47, 141, or 157.

[0177] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a circulatory disorder. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:2, 48, 142, or 158 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a circulatory disorder.

[0178] In a further aspect, the invention features a method for diagnosing a craniofacial defect or the propensity to develop a craniofacial defect in an organism. This method includes detecting an alteration in the level of 215, 307, 572, 1116A, 1548, Casein Kinase 1α, 299, 297, DNA Replication Licensing Factor MCM2, 1257, TCP-1 Beta, Nuclear Cap Binding Protein Subunit 2, Non-Muscle Adenylosuccinate Synthase, or Omithine Decarboxylase polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a 215, 307, 572, 1116A, 1548, Casein Kinase 1α, 299, 297, DNA Replication Licensing Factor MCM2, 1257, TCP-1 Beta, Nuclear Cap Binding Protein Subunit 2, Non-Muscle Adenylosuccinate Synthase, or Ornithine Decarboxylase polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the 215, 307, 572, 1116A, 1548, Casein Kinase 1α, 299, 297, DNA Replication Licensing Factor MCM2, 1257, TCP-1 Beta, Nuclear Cap Binding Protein Subunit 2, Non-Muscle Adenylosuccinate Synthase, or Ornithine Decarboxylase polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a craniofacial defect.

[0179] In a further aspect, the invention features a method for diagnosing a craniofacial defect or the propensity to develop a craniofacial defect in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a 215, 307, 572, 1116A, 1548, Casein Kinase 1α, 299, 297, DNA Replication Licensing Factor MCM2, 1257, TCP-1 Beta, Nuclear Cap Binding Protein Subunit 2, Non-Muscle Adenylosuccinate Synthase, or Ornithine Decarboxylase nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a 215, 307, 572, 1116A, 1548, Casein Kinase 1α, 299, 297, DNA Replication Licensing Factor MCM2, 1257, TCP-1 Beta, Nuclear Cap Binding Protein Subunit 2, Non-Muscle Adenylosuccinate Synthase, or Ornithine Decarboxylase nucleic acid molecule derived from a second, control organism, where an alteration of the 215, 307, 572, 1116A, 1548, Casein Kinase 1α, 299, 297, DNA Replication Licensing Factor MCM2, 1257, TCP-1 Beta, Nuclear Cap Binding Protein Subunit 2, Non-Muscle Adenylosuccinate Synthase, or Ornithine Decarboxylase sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a craniofacial defect. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:31, 33, 35, 37, 39, 41, 67, 87, 115, 123, 129, 133, 135, or 137.

[0180] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a craniofacial defect. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:32, 34, 36, 38, 40, 42, 68, 88, 116, 124, 130, 134, 136, or 138 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a craniofacial defect.

[0181] In a further aspect, the invention features a method for diagnosing a hearing disorder or the propensity to develop a hearing disorder in an organism. This method includes detecting an alteration in the level of U1 Small Nuclear Ribonucleoprotein C, ARS2, TCP-1 Eta, or Histone Deacetylase polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a U1 Small Nuclear Ribonucleoprotein C, ARS2, TCP-1 Eta, or Histone Deacetylase polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the U1 Small Nuclear Ribonucleoprotein C, ARS2, or TCP-1 Eta, Histone Deacetylase polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a hearing disorder.

[0182] In a further aspect, the invention features a method for diagnosing a hearing disorder or the propensity to develop a hearing disorder in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a U1 Small Nuclear Ribonucleoprotein C, ARS2, TCP-1 Eta, or Histone Deacetylase nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a U1 Small Nuclear Ribonucleoprotein C, ARS2, TCP-1 Eta, or Histone Deacetylase nucleic acid molecule derived from a second, control organism, where an alteration of the U1 Small Nuclear Ribonucleoprotein C, ARS2, TCP-1 Eta, or Histone Deacetylase sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a hearing disorder. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:83, 125, 145, or 151.

[0183] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a hearing disorder. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:84, 125, 146, or 152 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a hearing disorder.

[0184] In a further aspect, the invention features a method for diagnosing diabetes or the propensity to develop diabetes in an organism. This method includes detecting an alteration in the level of 429 or 1447 polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a 429 or 1447 polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the 429 or 1447 polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop diabetes.

[0185] In a further aspect, the invention features a method for diagnosing diabetes or the propensity to develop diabetes in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a 429 or 1447 nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a 429 or 1447 nucleic acid molecule derived from a second, control organism, where an alteration of the 429 or 1447 sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop diabetes. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:47 or 143.

[0186] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of diabetes. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:48 or 144 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat diabetes.

[0187] In a further aspect, the invention features a method for diagnosing heart defect or the propensity to develop a heart defect in an organism. This method includes detecting an alteration in the level of 1548, Histone Deacetylase, Nuclear Cap Binding Protein Subunit 2, Mitochondrial Inner Membrane Translocating Protein (rTIM23), or TCP-1 Alpha polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a 1548, Histone Deacetylase, Nuclear Cap Binding Protein Subunit 2, Mitochondrial Inner Membrane Translocating Protein (rTIM23), or TCP-1 Alpha polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the 1548, Histone Deacetylase, Nuclear Cap Binding Protein Subunit 2, Mitochondrial Inner Membrane Translocating Protein (rTIM23), or TCP-1 Alpha polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a heart defect.

[0188] In a further aspect, the invention features a method for diagnosing heart defect or the propensity to develop heart defect in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a 1548, Histone Deacetylase, Nuclear Cap Binding Protein Subunit 2, Mitochondrial Inner Membrane Translocating Protein (rTIM23), or TCP-1 Alpha nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a 1548, Histone Deacetylase, Nuclear Cap Binding Protein Subunit 2, Mitochondrial Inner Membrane Translocating Protein (rTIM23), or TCP-1 Alpha nucleic acid molecule derived from a second, control organism, where an alteration of the 1548, Histone Deacetylase, Nuclear Cap Binding Protein Subunit 2, Mitochondrial Inner Membrane Translocating Protein (rTIM23), or TCP-1 Alpha sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a heart defect. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:39, 77, 135, 141, or 151.

[0189] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a heart defect. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:40, 136, or 152 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a heart defect.

[0190] In a further aspect, the invention features a method for diagnosing infertility or the propensity to develop infertility in an organism. This method includes detecting an alteration in the level of Spinster polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a Spinster polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the Spinster polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop infertility.

[0191] In a further aspect, the invention features a method for diagnosing infertility or the propensity to develop infertility in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a Spinster nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a Spinster nucleic acid molecule derived from a second, control organism, where an alteration of the Spinster sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop infertility. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:51.

[0192] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of infertility. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:52 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat infertility.

[0193] In a further aspect, the invention features a method for diagnosing a limb formation defect or the propensity to develop a limb formation defect in an organism. This method includes detecting an alteration in the level of Casein Kinase 1α or Histone Deacetylase polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a Casein Kinase 1α or Histone Deacetylase polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the Casein Kinase 1α or Histone Deacetylase polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a limb formation defect.

[0194] In a further aspect, the invention features a method for diagnosing a limb formation defect or the propensity to develop a limb formation defect in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a Casein Kinase 1α or Histone Deacetylase nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a Casein Kinase 1α or Histone Deacetylase nucleic acid molecule derived from a second, control organism, where an alteration of the Casein Kinase 1α or Histone Deacetylase sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a limb formation defect. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:41 or 151.

[0195] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a limb formation defect. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:42 or 152 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a limb formation defect.

[0196] In a further aspect, the invention features a method for diagnosing mental retardation or the propensity to develop mental retardation in an organism. This method includes detecting an alteration in the level of U2AF, 1463, VPSP18, 60S Ribosomal Protein L35, 299, 1373, Telomeric Repeat Factor 2, SIL, U1 Small Ribonucleoprotein C, SKIP, 297, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, ISWI/SNF, Chromosomal Assembly Protein (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Alpha, TCP-1 Beta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, ARS2, BAF53a, Histone Deacetylase, Protein Phosphatase 1 Nuclear Targeting Subunit protein (PNUTS), Omithine Decarboxylase, 1447, 1262, 994, or TAFII-55 polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a U2AF, 1463, VPSP18, 60S Ribosomal Protein L35, 299, 1373, Telomeric Repeat Factor 2, SIL, U1 Small Ribonucleoprotein C, SKIP, 297, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, ISWI/SNF, Chromosomal Assembly Protein (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Alpha, TCP-1 Beta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, ARS2, BAF53a, Histone Deacetylase, Protein Phosphatase 1 Nuclear Targeting Subunit protein (PNUTS), Omithine Decarboxylase, 1447, 1262, 994, or TAFII-55 polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the U2AF, 1463, VPSP18, 60S Ribosomal Protein L35, 299, 1373, Telomeric Repeat Factor 2, SIL, U1 Small Ribonucleoprotein C, SKIP, 297, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, ISWI/SNF, Chromosomal Assembly Protein (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Alpha, TCP-1 Beta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, ARS2, BAF53a, Histone Deacetylase, Protein Phosphatase 1 Nuclear Targeting Subunit protein (PNUTS), Omithine Decarboxylase, 1447, 1262, 994, or TAFII-55 polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop mental retardation.

[0197] In a further aspect, the invention features a method for diagnosing mental retardation or the propensity to develop mental retardation in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a U2AF, 1463, VPSP18, 60S Ribosomal Protein L35, 299, 1373, Telomeric Repeat Factor 2, SIL, U1 Small Ribonucleoprotein C, SKIP, 297, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, ISWI/SNF, Chromosomal Assembly Protein (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Alpha, TCP-1 Beta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, ARS2, BAF53a, Histone Deacetylase, Protein Phosphatase 1 Nuclear Targeting Subunit protein (PNUTS), Ornithine Decarboxylase, 1447, 1262, 994, or TAFII-55 nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a U2AF, 1463, VPSP18, 60S Ribosomal Protein L35, 299, 1373, Telomeric Repeat Factor 2, SIL, U1 Small Ribonucleoprotein C, SKIP, 297, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, ISWI/SNF, Chromosomal Assembly Protein (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Alpha, TCP-1 Beta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, ARS2, BAF53a, Histone Deacetylase, Protein Phosphatase 1 Nuclear Targeting Subunit protein (PNUTS), Ornithine Decarboxylase, 1447, 1262, 994, or TAFII-55 nucleic acid molecule derived from a second, control organism, where an alteration of the U2AF, 1463, VPSP18, 60S Ribosomal Protein L35, 299, 1373, Telomeric Repeat Factor 2, SIL, U1 Small Ribonucleoprotein C, SKIP, 297, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, ISWLI/SNF, Chromosomal Assembly Protein (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Alpha, TCP-1 Beta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, ARS2, BAF53a, Histone Deacetylase, Protein Phosphatase 1 Nuclear Targeting Subunit protein (PNUTS), Ornithine Decarboxylase, 1447, 1262, 994, or TAFII-55 sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop mental retardation. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:7, 21, 25, 67, 69, 71, 79, 81, 83, 85, 89, 91, 93, 97, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 127, 129, 131, 133, 135, 137, 139, 143, 145, 149, 151, 155, or 157.

[0198] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of mental retardation. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:8, 20, 22, 26, 68, 70, 72, 80, 82, 84, 86, 90, 92, 94, 98, 102, 104, 106, 108, 112, 114, 116, 118, 120, 122, 124, 128, 130, 132, 134, 136, 138, 140, 144, 146, 150, 152, 156, or 158 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide ma be used to treat mental retardation.

[0199] In a further aspect, the invention features a method for diagnosing a muscle defect or the propensity to develop a muscle defect in an organism. This method includes detecting an alteration in the level of 428 or Denticleless polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a 428 or Denticleless polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the 428 or Denticleless polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a muscle defect.

[0200] In a further aspect, the invention features a method for diagnosing a muscle defect or the propensity to develop a muscle defect in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a 428 or Denticleless nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a 428 or Denticleless nucleic acid molecule derived from a second, control organism, where an alteration of the 428 or Denticleless sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a muscle defect. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:49 or 73.

[0201] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a muscle defect. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:50 or 74 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a muscle defect.

[0202] In a further aspect, the invention features a method for diagnosing a neurodegenerative disorder or the propensity to develop a neurodegenerative disorder in an organism. This method includes detecting an alteration in the level of Vesicular Integral Membrane Protein VIP 36, 297, 40S Ribosomal Protein S5, V-ATPase 16 kDa Proteolytic Subunit, Kinesin Related Motor Protein EG5, 299, 821-02, Valyl-tRNA Synthase, or Translation Elongation Factor eEF1 Alpha polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a Vesicular Integral Membrane Protein VIP 36, 297, 40S Ribosomal Protein S5, V-ATPase 16 kDa Proteolytic Subunit, Kinesin Related Motor Protein EG5, 299, 821-02, Valyl-tRNA Synthase, or Translation Elongation Factor eEF1 Alpha polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the Vesicular Integral Membrane Protein VIP 36, 297, 40S Ribosomal Protein S5, V-ATPase 16 kDa Proteolytic Subunit, Kinesin Related Motor Protein EG5, 299, 821-02, Valyl-tRNA Synthase, or Translation Elongation Factor eEF1 Alpha polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a neurodegenerative disorder.

[0203] In a further aspect, the invention features a method for diagnosing a neurodegenerative disorder or the propensity to develop a neurodegenerative disorder in an organism. This method includes detecting an alteration in the sequence of a Vesicular Integral Membrane Protein VIP 36, 297, 40S Ribosomal Protein S5, V-ATPase 16 kDa Proteolytic Subunit, Kinesin Related Motor Protein EG5, 299, 821-02, Valyl-tRNA Synthase, or Translation Elongation Factor eEF1 Alpha nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a Vesicular Integral Membrane Protein VIP 36,297, 40SRibosomal Protein S5, V-ATPase 16 kDa Proteolytic Subunit, Kinesin Related Motor Protein EG5, 299, 821-02, Valyl-tRNA Synthase, or Translation Elongation Factor eEF1 Alpha nucleic acid molecule derived from a second, control organism, where an alteration of the Vesicular Integral Membrane Protein VIP 36, 297, 40S Ribosomal Protein S5, V-ATPase 16 kDa Proteolytic Subunit, Kinesin Related Motor Protein EG5, 299, 821-02, Valyl-tRNA Synthase, or Translation Elongation Factor eEF1 Alpha sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a neurodegenerative disorder. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:19, 57, 65, 67, 87, 95, 119, 121, or 127.

[0204] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a neurodegenerative disorder. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:20, 58, 66, 68, 88, 96, 120, 122, or 128 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a neurodegenerative disorder.

[0205] In a further aspect, the invention features a method for diagnosing stroke or the propensity to develop stroke in an organism. This method includes detecting an alteration in the level of 1463 polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a 1463 polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the 1463 polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop stroke.

[0206] In a further aspect, the invention features a method for diagnosing stroke or the propensity to develop stroke in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence of a 1463 nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a 1463 nucleic acid molecule derived from a second, control organism, where an alteration of the 1463 sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop stroke. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:157.

[0207] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of stroke. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:158 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat stroke.

[0208] In a further aspect, the invention features a method for diagnosing a stem cell regeneration disorder or the propensity to develop a stem cell regeneration disorder in an organism. This method includes detecting an alteration in the level of 1447 polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a 1447 polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the 1447 polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a stem cell regeneration disorder.

[0209] In a further aspect, the invention features a method for diagnosing a stem cell regeneration disorder or the propensity to develop a stem cell regeneration disorder in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence of a 1447 nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a 1447 nucleic acid molecule derived from a second, control organism, where an alteration of the 1447 sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a stem cell regeneration disorder. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:143.

[0210] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a stem cell regeneration disorder. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO: 144 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a stem cell regeneration disorder.

[0211] In a further aspect, the invention features a method for diagnosing a visual defect or the propensity to develop a visual defect in an organism. This method includes detecting an alteration in the level of V-ATPase SFD subunit, V-ATPase 16 kDa Proteolytic Subunit, V-ATPase Alpha Subunit, 1463, VPSP18, or 297 polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a V-ATPase SFD subunit, V-ATPase 16 kDa Proteolytic Subunit, V-ATPase Alpha Subunit, 1463, VPSP18, Ribonucleotide Reductase R1 Class 1, Telomeric Repeat Factor 2, SIL, ISWI/SNF2 or 297 polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the V-ATPase SFD subunit, V-ATPase 16 kDa Proteolytic Subunit, V-ATPase Alpha Subunit, 1463, VPSP18, Ribonucleotide Reductase R1 Class 1, Telomeric Repeat Factor 2, SIL, ISWI/SNF2 or 297 polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a visual defect.

[0212] In a further aspect, the invention features a method for diagnosing a visual defect or the propensity to develop a visual defect in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence of a V-ATPase SFD subunit, V-ATPase 16 kDa Proteolytic Subunit, V-ATPase Alpha Subunit, 1463, VPSP18, Ribonucleotide Reductase R1 Class 1, Telomeric Repeat Factor 2, SIL, ISWI/SNF2 or 297 nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a V-ATPase SFD subunit, V-ATPase 16 kDa Proteolytic Subunit, V-ATPase Alpha Subunit, 1463, VPSP18, Ribonucleotide Reductase R1 Class 1, Telomeric Repeat Factor 2, SIL, ISWI/SNF2 or 297 nucleic acid molecule derived from a second, control organism, where an alteration of the V-ATPase SFD subunit, V-ATPase 16 kDa Proteolytic Subunit, V-ATPase Alpha Subunit, 1463, VPSP18, Ribonucleotide Reductase R1 Class 1, Telomeric Repeat Factor 2, SIL, ISWI/SNF2 or 297 sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a visual defect. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:15, 17, 19, 21, 55, 79, 81, 87, 111, or 157.

[0213] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a visual defect. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:16, 18, 20, 22, 56, 80, 82, 88, 112, or 158 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a visual defect.

[0214] In a further aspect, the invention features a method for diagnosing a pulmonary disorder or the propensity to develop a pulmonary disorder in an organism. This method includes detecting an alteration in the level of Fibroblast Isoform of the ADP/ATP Carrier Protein or TAFII-55 polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a Fibroblast Isoform of the ADP/ATP Carrier Protein or TAFII-55 polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the Fibroblast Isoform of the ADP/ATP Carrier Protein or TAFII-55 polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a pulmonary disorder.

[0215] In a further aspect, the invention features a method for diagnosing a pulmonary disorder or the propensity to develop a pulmonary disorder in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a Fibroblast Isoform of the ADP/ATP Carrier Protein or TAFII-55 nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a Fibroblast Isoform of the ADP/ATP Carrier Protein or TAFII-55 nucleic acid molecule derived from a second, control organism, where an alteration of the Fibroblast Isoform of the ADP/ATP Carrier Protein or TAFII-55 sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a pulmonary disorder. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:153 or 155.

[0216] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a pulmonary disorder. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:154 or 156 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a pulmonary disorder.

[0217] In a further aspect, the invention features a method for diagnosing a movement disorder or the propensity to develop a movement disorder in an organism. This method includes detecting an alteration in the level of 40S Ribosomal Protein S5 polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a 40S Ribosomal Protein S5 polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the 40S Ribosomal Protein S5 polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a movement disorder.

[0218] In a further aspect, the invention features a method for diagnosing a movement disorder or the propensity to develop a movement disorder in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a 40S Ribosomal Protein S5 nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a 40S Ribosomal Protein S5 nucleic acid molecule derived from a second, control organism, where an alteration of the 40S Ribosomal Protein SS sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a movement disorder. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:121.

[0219] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a movement disorder. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:122 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a movement disorder.

[0220] In a further aspect, the invention features a method for diagnosing a somite formation disorder or the propensity to develop a somite formation disorder in an organism. This method includes detecting an alteration in the level of 428 or Denticleless polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a 428 or Denticleless polypeptide in a sample derived from a second, control organism, where an alteration in the level of expression or activity of the 428 or Denticleless polypeptide in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a somite formation disorder.

[0221] In a further aspect, the invention features a method for diagnosing a somite formation disorder or the propensity to develop a somite formation disorder in an organism. This method includes detecting an alteration in the sequence, or a portion of the sequence, of a 428 or Denticleless nucleic acid molecule in a sample derived from a first organism and comparing the sequence to that of a 428 or Denticleless nucleic acid molecule derived from a second, control organism, where an alteration of the 428 or Denticleless sequence in the first organism relative to the second organism is indicative of the first organism having or having a propensity to develop a somite formation disorder. In the alternative, the alteration may be an increase or a decrease in the expression or activity of a nucleic acid molecule including the sequence of SEQ ID NO:49 or 73.

[0222] In a further aspect, the invention features a method for identifying a candidate compound for the treatment of a somite formation disorder. This method includes the steps of: (a) contacting a polypeptide including the amino acid sequence of SEQ ID NO:50 or 74 with a candidate compound, and (b) detecting an interaction of the polypeptide with the candidate compound, where a candidate compound that binds the polypeptide may be used to treat a somite formation disorder.

[0223] In a further aspect, the invention features an antibody that specifically binds to a polypeptide described herein and in another aspect, the invention features a zebrafish that includes a mutant nucleic acid molecule described herein.

[0224] Generally, the novel nucleic acid and amino acid sequences described herein may be, for example, naturally-occurring. These nucleic acid and amino acid sequences may be, for example, used in protein-protein interaction assays (e.g., two-hybrid, three-hybrid, and co-immunoprecipitation). In addition, the novel nucleic acid sequences described herein may be used to generate transgenic animals, for example, zebrafish, mice, and rats. Furthermore, such transgenic animals may be used in whole animal assays, such as assays to identify candidate compounds potentially useful for treating a disease or disorder. Moreover, these novel nucleic acid and amino acid molecules may be used, for instance, to generate probes and primers, as well as anti-sense nucleic acid sequences complementary to a novel nucleic acid sequence described herein that may be used to inhibit the biological activity of the nucleic acid and amino acid sequences described herein, regardless of the length of the anti-sense nucleic acid sequence. These antisense nucleic acid sequences may be used to treat disease and may also be used to form pharmaceutical compositions.

[0225] Definitions

[0226] By a “459 protein,” or a “459 polypeptide” is meant a polypeptide that has at least 72% amino acid sequence identity to the zebrafish 459 amino acid sequence of SEQ ID NO:60 over a region spanning at least 233 contiguous amino acids. Desirably, a “459 protein” or a “459 polypeptide” is at least 80%, 85%, 90%, 95%, 97%, 99%, or even 100% identical to the sequence of SEQ ID NO:60 over at least 150, 175, 200, or 233 contiguous amino acids. Polypeptides encoded by splice variants of 459 nucleic acid sequences, as well as by 459 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus at nucleotide 210 of a 459 nucleic acid sequence (e.g., SEQ ID NO:59) are also included in this definition. A “459 protein” or a “459 polypeptide,” as referred to herein, plays a role in kidney development and in cell death during development. In zebrafish, the loss of, or an alteration in, a 459 polypeptide in a cell may result in the development of a cyst in a kidney, a bent body shape, or in the appearance of apoptotic cells in the Central Nervous System (CNS). Accordingly, a “459 protein” or a “459 polypeptide” may be used as a marker for, or to prevent or treat, for example, a kidney disorder, for example, polycystic kidney disease, multicystic kidney disease, or malformation of the kidney; a neurodegenerative disease characterized by excess cell death, for example, Alzheimer's disease, Parkinson's disease, or spinocerebellar ataxia; or may be used for the treatment of a proliferative disorder, such as cancer.

[0227] By a “459 nucleic acid sequence” is meant a nucleic acid sequence that has at least 74%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish 459 nucleic acid sequence of SEQ ID NO:59 over at least 250, 584, 700, 800, 900, 1000, 1500, 2000, or 2500 contiguous nucleotides. In a desirable embodiment, a “459 nucleic acid sequence” is identical to the sequence of SEQ ID NO:59. Nucleic acid molecules consisting of splice variants of 459 nucleic acid sequences, as well as 459 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 210 of SEQ ID NO:59, are also included in this definition. By a “459 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 459 polypeptide or a zebrafish 459 polypeptide (e.g., SEQ ID NO:60), or a fragment thereof, as defined above. A mutation in a 459 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 459 expression or function, including, as examples, null mutations and mutations causing truncations.

[0228] In zebrafish, an alteration in a 459 nucleic acid sequence in a cell may result in the development of a cyst in a kidney, a bent body shape, or in the appearance of apoptotic cells in the Central Nervous System (CNS). Accordingly, a “459 nucleic acid sequence” may be used as a marker for, or to prevent or treat, for example, a kidney disorder, for example, polycystic kidney disease, multicystic kidney disease, or malformation or abnormal development of the kidney; a neurodegenerative disease characterized by excess cell death, for example, Alzheimer's disease, Parkinson's disease, or spinocerebellar ataxia; or may be used for the treatment of a proliferative disorder, such as cancer.

[0229] By a “904 protein” or a “904 polypeptide” is meant a polypeptide that has at least 92%, 95%, 97%, or 99% amino acid sequence identity over at least 75, 100, or 150 contiguous amino acids to the zebrafish 904 polypeptide sequence of SEQ ID NO:2. Desirably, a “904 protein” or a “904 polypeptide” is at least 92%, 95%, 97%, or even 100% identical to the sequence of SEQ ID NO:2. Polypeptides encoded by splice variants of 904 nucleic acid sequences, as well as by 904 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 1315 of SEQ ID NO:1, are also included in this definition. A “904 protein” or a “904 polypeptide,” as referred to herein, plays a role in brain development, compartmentalization, or function. The loss of a 904 polypeptide in a cell may result in an overgrowth of neural tissue, increased vascularization, and in brain hemorrhages. Accordingly, a “904 protein” or a “904 polypeptide” may be used as a marker for, or to prevent or treat, for example, a proliferative disease, such as cancer or neuroblastoma, or a circulatory disorder, such as stroke.

[0230] By a “POU2 protein” or a “POU2 polypeptide” is meant a polypeptide that has at least 37%, 40%, 45%, 50%, 60%, 75%, 90%, 95%, or 99% amino acid sequence identity to the zebrafish POU2 polypeptide sequence of SEQ ID NO:4 over a region spanning at least 100, 200, 300, 350 contiguous amino acids. In one desirable embodiment, a “POU2 protein” or a “POU2 polypeptide” is identical to the sequnce of SEQ ID NO:4. Polypeptides encoded by splice variants of POU2 nucleic acid sequences, as well as by POU2 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 653 or 1088 of SEQ ID NO:3, are also included in this definition. A “POU2 protein” or a “POU2 polypeptide,” as referred to herein, plays a role in neural development, e.g., mid-brain and hind-brain development, as well as, e.g., in zebrafish, in the development of the otolith, or the hair cells of the otolith. Accordingly, a “POU2 protein” or a “POU2” polypeptide” may be used as a marker for, or to prevent or treat, congenital hearing or a sensory disorder, such as Usher syndrome or Waardenburg syndrome, or a congenital brain defect that causes mental retardation, such as Down's syndrome, Chiari Malformation, colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly.

[0231] By a “40S Ribosomal Protein S18protein” or a “40S Ribosomal Protein S18 polypeptide” is meant a polypeptide that has at least 97%, 98%, or at least 99% amino acid sequence identity to the zebrafish 40S ribosomal protein S118 sequence of SEQ ID NO:6 over a region spanning at least 75, 100, 115, or 152 contiguous amino acids. Desirably, a “40S Ribosomal Protein S18 protein” or “40S Ribosomal Protein S18 polypeptide” is meant a polypeptide that is identical to the sequence of SEQ ID NO:6. Polypeptides encoded by splice variants of 40S Ribosomal Protein S18 nucleic acid sequences, as well as by 40S Ribosomal Protein S18 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 220 and 221 of SEQ ID NO:5, are also included in this definition. A. “40S Ribosomal Protein S18 protein” or a “40S Ribosomal Protein S18 polypeptide,” as referred to herein, plays a role in cell proliferation, e.g., neuronal proliferation. For example, in zebrafish, a “40S Ribosomal Protein S18 protein” or a “40S Ribosomal Protein S18 polypeptide,” as referred to herein, plays a role in brain development, neural organization, or compartmentalization, or eye development. In addition, the loss of a zebrafish 40S Ribosomal Protein S18 polypeptide in a cell may result in a kinked tail, a reduced forebrain, and a bigger hind-brain. Accordingly, a “40S Ribosomal Protein S18 protein” or a “40S Ribosomal Protein S18 polypeptide” may be used as a marker for, or to prevent or treat, for example, a proliferative disorder such as cancer or neuroblastoma, or an eye malformation syndrome, such as oculorenal syndrome or Reiger syndrome, or a congenital brain defect that causes mental retardation, such as microencephaly, anencephaly, Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly.

[0232] By a U2AF protein” or a “U2AF polypeptide” is meant a polypeptide that has at least 92%, 95%, 97%, or 99% amino acid sequence identity to the zebrafish splicing factor U2AF amino acid sequence of SEQ ID NO:8 over a region spanning at least 150, 175, 200, 225, or 250 contiguous amino acids. Desirably, a “U2AF protein” or a “U2AF polypeptide” is identical to the sequence of SEQ ID NO:8. Polypeptides encoded by splice variants of U2AF nucleic acid sequences, as well as U2AF nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 46 and 47 of the sequence of SEQ ID NO:7, are also included in this definition. A “U2AF protein” or a “U2AF polypeptide,” as referred to herein, plays a role in brain development, particularly in the tectum. The loss of a U2AF polypeptide in a cell may result in brain necrosis, particularly in the tectum. Accordingly, a “U2AF protein,” or a “U2AF polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy.

[0233] By a “954 protein” or a “954 polypeptide” is meant a polypeptide that has at least 93%, 95%, 97%, or 99% amino acid sequence identity to the zebrafish 954 amino acid sequence of SEQ ID NO:10 or SEQ ID NO:159 over a region spanning at least 200, 300, or 334 amino acids. Desirably, a “954 protein” or a “954 polypeptide” is identical to the sequence of SEQ ID NO:10 or SEQ ID NO:159. Polypeptides encoded by splice variants of 954 nucleic acid sequences, as well as 954 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 432 or 506 of the sequence of SEQ ID NO:9, are also included in this definition. A “954 protein” or a “954 polypeptide,” as referred to herein, plays a role in cartilage development. Accordingly, a “954 protein,” or a “954 polypeptide” may be used as a marker for, or to prevent or treat, for example, a connective tissue disorder such as arthritis.

[0234] By a “Neurogenin Related Protein-1,” a “Nrp-1 protein,” a “Nrp-1 polypeptide” or a “Neurogenin Related Protein-1 polypeptide” is meant a polypeptide that is identical to a zebrafish Nrp-1 amino acid sequence, for example, the sequence of SEQ ID NO:12. Polypeptides encoded by splice variants of Nrp-1 nucleic acid sequences, as well as Nrp-1 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 1149 of the sequence of SEQ ID NO:11, are also included in this definition. A “Neurogenin Related Protein-1,” a “Nrp-1 protein,” or a “Neurogenin Related Protein-1 polypeptide,” as referred to herein, plays a role in cell fate determination or in jaw development. The loss of a Nrp-1 polypeptide in a cell may also result in touch insensitivity in the head, a gaping jaw, and motility problems. Accordingly, a “Neurogenin Related Protein-1,” a “Nrp-1 protein,” a “Nrp-1 polypeptide” or a “Neurogenin Related Protein-1 polypeptide” may be used as a marker for, or to prevent or treat, for example, a movement disorder such as Angelman's syndrome, tardive dyskinesia, spasticity, ataxias, or spinocerebellar ataxia; also neurodegenerative disorders, for example, Huntington's disease, or Parkinson's disease, Alzheimer's disease, or a craniofacial defect, such as Apert, Crouzon, Pfeiffer and Saethre-Chotzen Syndromes

[0235] By a “Caudal Protein,” a “Cad-1 protein,” or a “Caudal polypeptide” is meant a polypeptide that is identical to a zebrafish Cad-1 amino acid sequence, for example, the sequence of SEQ ID NO:14. Polypeptides encoded by splice variants of Cad-1 nucleic acid sequences, as well as Cad-1 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 583 and 584 of the sequence of SEQ ID NO:13, are also included in this definition. A “Caudal Protein,” a “Cad-1 protein,” or a “Caudal polypeptide,” as referred to herein, plays a role in trunk and tail development, motility, touch sensitivity and, e.g., in zebrafish, in the formation of the yolk sac extension. Accordingly, a “Caudal Protein,” a “Cad-1 protein,” or a “Caudal polypeptide” may be used as a marker for, or to prevent or treat, for example, a movement disorder such as Angelman's syndrome, tardive dyskinesia, spasticity, ataxias, or spinocerebellar ataxia, or a neurodegenerative disorder, such as Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, or spinocerebellar ataxia.

[0236] By a “V-ATPase Alpha Subunit Protein” or a “V-ATPase Alpha Subunit polypeptide” is meant a polypeptide that has at least 77%, 80%, 85%, 90%, 95%, 97%, or 99% identity to the sequence of SEQ ID NO:16, over at least 150, 175, 200, or 226 amino acids. Desirably, a “V-ATPase Alpha Subunit Protein” or a “V-ATPase Alpha Subunit polypeptide” is identical to the sequence of SEQ ID NO:16. Polypeptides encoded by splice variants of V-ATPase Alpha Subunit nucleic acid sequences (e.g., SEQ ID NO:15), as well as V-ATPase Alpha Subunit nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 169 of the sequence of SEQ ID NO:15, are also included in this definition. A “V-ATPase Alpha Subunit polypeptide,” as referred to herein, plays a role in pigmentation in the body or the eye. Accordingly, a “V-ATPase Subunit Alpha polypeptide” or a “V-ATPase Alpha Subunit protein” may be used as a marker for, or to prevent or treat, for example, a variety of disorders, particularly disorders related to pigmentation, such as human pigmentary glaucoma, Wagner's disease, Cockayne's syndrome, Kearns-Sayre disease, Waardenburg syndrome, Usher's syndrorme, or Multiple lentigines syndrome.

[0237] By a “V-ATPase SFD Subunit protein” or a “V-ATPase SFD Subunit polypeptide” is meant a polypeptide that has at least 89%, 91%, 93%, 95%, 97%, or 99% identity to the zebrafish V-ATPase subunit SFD sequence of SEQ ID NO:18 over at least 100, 150, 200, 250, 300, 400, or 450 contiguous amino acids. Desirably, a “V-ATPase SFD Subunit protein” or a “V-ATPase SFD Subunit polypeptide” is identical to the sequence of SEQ ID NO:18. Polypeptides encoded by splice variants of V-ATPase SFD Subunit nucleic acid sequences, as well as V-ATPase SFD Subunit nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 31-32 of the sequence of SEQ ID NO:17, are also included in this definition. A “V-ATPase SFD Subunit Protein,” a “V-ATP synthase subunit SFD protein,” or a “V-ATPase SFD Subunit polypeptide,” as referred to herein, plays a role in eye and body pigmentation. Accordingly, a “V-ATPase SFD Subunit protein,” or a “V-ATPase SFD Subunit polypeptide” may be used as a marker for, or to prevent or treat, a variety of disorders, such as developmental disorders, particularly disorders related to pigmentation, for example, human pigmentary glaucoma, Wagner's disease, Cockayne's syndrome, Kearns-Sayre disease, Waardenburg syndrome, Usher's syndrome, or Multiple lentigines syndrome.

[0238] By a “V-ATPase 16 kDa Proteolytic Subunit protein” or a “V-ATPase 16 kDa Proteolytic Subunit polypeptide” is meant a polypeptide that has at least 91%, 93%, 95%, or 99% identity to the zebrafish V-ATPase 16 kDa Proteolytic Subunit amino acid sequence of SEQ ID NO:20 over at least 50, 75, 100, 125, or 150 contiguous amino acids. Desirably, a “V-ATPase 16 kDa Proteolytic Subunit protein” or a “V-ATPase 16 kDa Proteolytic Subunit polypeptide” is idenical to the sequence of SEQ ID NO:20. Polypeptides encoded by splice variants of VATPase 16 kDa Proteolytic Subunit nucleic acid sequences, as well as V-ATPase 16 kDa Proteolytic Subunit nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 242-243 of the sequence of SEQ ID NO:19, are also included in this definition. The V-ATPase 16 kDa Proteolytic Subunit nucleic acid sequence plays a role in body and eye pigmentation, and also touch sensitivity. Accordingly, a “V-ATPase 16 kDa Proteolytic Subunit protein,” or a “V-ATPase 16 kDa Proteolytic Subunit polypeptide” may be used as a marker for, or to prevent or treat, for example, a disorder related to pigmentation, such as human pigmentary glaucoma, Wagner's disease, Cockayne's syndrome, Kearns-Sayre disease, Waardenburg syndrome, Usher's syndrome, or Multiple lentigines syndrome, or a neurodegenerative disease, such as Alzheimer's disease, Huntington's disease, Parkinson's disease, or spinocerebellar ataxia.

[0239] By a “1463 protein” or a “1463 polypeptide” is meant a polypeptide that has at least 44%, 50%, 60%, 70%, 80%, 90%, or 95% identity to the zebrafish 1463 polypeptide sequence of SEQ ID NO:158 over at least 100, 150, 200, 250, 300, 350, 400, 450, or 500 contiguous amino acids. Desirably, a “1463 protein” or a “1463 polypeptide” is identical to the sequence of SEQ ID NO:158. Polypeptides encoded by splice variants of 1463 nucleic acid sequences, as well as 1463 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 389 and 390 of the nucleic acid sequence of SEQ ID NO:157 as referred to herein, are also included in this definition. 1463 plays a role in body pigmentation, brain morphogenesis, or angiogenesis. For example, in zebrafish, the loss of a 1463 polypeptide in a cell results in brain dysmorphia, a shortened hind-brain and swollen tectum, or a defect in body pigmentation. Accordingly, a “1463 protein,” or a “1463 polypeptide” may be used as a marker for, or to prevent or treat, for example, disorders related to pigmentation, such as human pigmentary glaucoma, Wagner's disease, Cockayne's syndrome, Kearns-Sayre disease, Waardenburg syndrome, Usher's syndrome, or multiple lentigines syndrome, or a developmental neurological disorder, such as autism, or a congenital brain defect that causes mental retardation, such as Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly, or a circulatory disorder, such as stroke.

[0240] By a “Vacuolar Sorting Protein 18,” “VPSP18 protein,” or a “VPSP18 polypeptide” is meant a polypeptide that has at least 65%, 70%, 80%, 90%, or 95% identity to the zebrafish VPSP18 sequence of SEQ ID NO:22 over a region spanning at least 500, 600, 700, 800, 900, or 974 contiguous amino acids. Desirably, a “VPSP18 protein” or a “VPSP18 polypeptide” is identical to the sequence of SEQ ID NO:22. Polypeptides encoded by splice variants of VPSP18 nucleic acid sequences, as well as polypeptides encoded by VPSP18 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 2336 of the sequence of SEQ ID NO:21 as referred to herein, are also included in this definition. VPSP18 plays a role in pigmentation, iridophore development, or tectum development. Accordingly, a “VPSP18 protein,” or a “VPSP18 polypeptide” may be used as a marker for, or to prevent or treat, for example, disorders related to pigmentation, such as human pigmentary glaucoma, Wagner's disease, Cockayne's syndrome, Kearns-Sayre disease, or multiple lentigines syndrome, or a sensory disorder, such as Waardenburg syndrome, or Usher's syndrome, or a developmental neurological disorder, for example, autism, or a retinal disorder, such as retinitis, macular degeneration, Friedreich's ataxia, or Laurence-Moon syndrome.

[0241] By a “Pescadillo protein” or a “Pescadillo polypeptide” is meant a polypeptide encoded by a Pescadillo gene (e.g., GenBank Accession No. U77627). Polypeptides encoded by splice variants of Pescadillo nucleic acid sequences, as well as Pescadillo nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at approximately nucleotide 20 of the Pescadillo nucleic acid sequence of GenBank Accession No. U77627, are also included in this definition. In addition, a “Pescadillo protein,” or a “Pescadillo polypeptide,” as referred to herein, plays a role in embryonic organ formation and cell cycle checkpoints. Accordingly, a “Pescadillo protein,” or a “Pescadillo polypeptide” may be used as a marker for, or to prevent or treat, for example, a proliferative disorder, such as cancer or neuroblastoma.

[0242] By a “HNF1-β/vHNF1 protein,” or a “HNF1-β/vHNF1 polypeptide” is meant a polypeptide that has at least 80%, 85%, 90%, 95%, or 99% amino acid sequence identity to the zebrafish HNF1-β/vHNF1 polypeptide sequence of SEQ ID NO:24 iver at keast 300, 350, 400, 500, or 550 contiguous amino acids. Desirably, a “HNF1-β/vHNF1 protein” or a “HNF1-β/vHNF1 polypeptide” is identical to the sequence of SEQ ID NO:24. Polypeptides encoded by splice variants of HNF1-β/vHNF1 nucleic acid sequences, as well as by HNF1-β/vHNF1 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 1682 and 1683 of the sequence of SEQ ID NO:23, or at nucleotide 361 or 745, are also included in this definition. A “HNF1-β/vHNF1 protein” or a “HNF1-β/vHNF1 polypeptide,” as referred to herein, plays a role in kidney or pancreas development, as well as in patterning the hind-brain. Accordingly, a “HNF1-β/vHNF1 protein,” or a “HNF1-β/vHNF1 polypeptide” may be used a marker for, or to prevent or treat, for example, a pancreatic or kidney disorder, such as diabetes, polycystic kidney disease, or Bardet-Biedl syndrome, or a congenital brain defect that causes mental retardation, such as Down's syndrome, Chiari Malformation, Colpocephaly, Holoprosencephaly, Lissencephaly, or Megalencephaly.

[0243] By a “60S Ribosomal L35 protein,” or a “60S Ribosomal L35 polypeptide” is meant a polypeptide that has at least 92%, 95%, 97%, or 99% amino acid sequence identity to the zebrafish 60S Ribosomal L35 polypeptide over at least 50, 75, 100, or 123 contiguous amino acids encoded by the 60S ribosomal L35 polypeptide sequence of SEQ ID NO:26. Desirably, a “60S Ribosomal L35 protein” or a “60S Ribosomal L35 polypeptide” is identical to the sequence of SEQ ID NO:26. Polypeptides encoded by splice variants of 60S Ribosomal L35 nucleic acid sequences, as well as by 60S Ribosomal L35 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 30 and 31 of the sequence of SEQ ID NO:25, are also included in this definition. A “60S Ribosomal L35 protein” or a “60S Ribosomal L35 polypeptide,” as referred to herein, plays a role in brain, head and eye development. A 60S Ribosomal protein may also be involved in the development of somite boundaries. Accordingly, a “60S Ribosomal L35 protein” or a “60S Ribosomal L35 polypeptide” may be used as a marker for, or to prevent or treat, for example, brain and/or eye disorders, sach as Balci's syndrome, or a congenital brain defect that causes mental retardation, such as microencephaly, anencephaly, Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly, or muscle disorders, for example, a congenital muscular dystrophy disorder.

[0244] By a “60S Ribosomal L44 protein,” or a “60S Ribosomal L44 polypeptide” is meant a polypeptide that has at least 98% or 99% amino acid sequence identity to the zebrafish 60S Ribosomal L44 polypeptide sequence of SEQ ID NO:28 over at least 50, 75, or 106 amino acids. Desirably, a “60S Ribosomal L44 protein” or a “60S Ribosomal L44 polypeptide” is identical to the sequence of SEQ ID NO:28. Polypeptides encoded by splice variants of 60S Ribosomal L44 nucleic acid sequences, as well as by 60S Ribosomal L44 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 195 and 196 of the nucleic acid sequence of SEQ ID NO:27, are also included in this definition. A “60S Ribosomal L44 protein” or a “60S Ribosomal L44 polypeptide,” as referred to herein, plays a role in brain development and, in zebrafish, in the formation of the yolk sac extension. Accordingly, the loss of a 60S Ribosomal L44 polypeptide in a cell may result in an organism with large brain ventricles and a defective yolk. Consequently, a “60S Ribosomal L44 protein” or a “60S Ribosomal L44 polypeptide” may be used as a marker for, or to prevent or treat, for example, a congenital brain defect that causes mental retardation, such as microencephaly, anencephaly, Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly.

[0245] By a “CopZ1 protein,” or a “CopZ1 polypeptide” is meant a polypeptide that is, for example, identical to the sequence of SEQ ID NO:30. Polypeptides encoded by splice variants of copZ1 nucleic acid sequences, as well as by copZ1 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 90 and 91 of the sequence of SEQ ID NO:29, are also included in this definition. A “CopZ1 protein” or a “CopZ1 polypeptide,” as referred to herein, plays a role in maintaining the retina. The loss of a CopZ1 polypeptide in a cell may result in eye degeneration, particularly in the retinal pigmented epithelia. Accordingly, a “CopZ1 protein” or a “CopZ1 polypeptide” may be used as a marker for, or to prevent or treat, for example, retinitis pigmentosa, macular degeneration, Friedreich's ataxia, or Laurence-Moon syndrome.

[0246] By a “215 protein,” or a “215 polypeptide” is meant a polypeptide that has at least 78% amino acid sequence identity over at least 529 amino acids to the zebrafish 215 polypeptide sequence of SEQ ID NO:32. Desirably, a “215 protein” or a “215 polypeptide” is at least 78%, 80%, 85%, 90%, 95%, 97%, or 99% identical to the sequence of SEQ ID NO:31. In a more desirable embodiment, a “215 protein” or a “215 polypeptide” is identical to the sequence of SEQ ID NO:31 over at least 300, 400, 450, or 500 contiguous amino acids. Polypeptides encoded by splice variants of 215 nucleic acid sequences, as well as by 215 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 294 and 295 of the sequence of SEQ ID NO:31, are also included in this definition. A “215 protein” or a “215 polypeptide,” as referred to herein, plays a role in eye and jaw development. For example, in zebrafish, the loss of a 215 polypeptide in a cell may result in eyes that are at least 75% smaller than those of three day-old wild-type zebrafish, or a jaw that is at least 75% reduced when compared to that of a three day old wild-type zebrafish. Alternatively, the loss of a 215 protein may result in general underdevelopment or a bent ceratohyal. Accordingly, a “215 protein” or a “215 polypeptide” may be used as a marker for, or to prevent or treat, for example, a craniofacial defect, such as Apert, Crouzon, Pfeiffer and Saethre-Chotzen Syndromes, or an eye malformation syndrome, such as oculorenal syndrome or Reiger syndrome.

[0247] By a “307 protein,” or a “307 polypeptide” is meant a polypeptide that has at least 54% amino acid sequence identity over at least 199 amino acids to the zebrafish 307 polypeptide sequence of SEQ ID NO:34. Desirably, a “307 protein” or a “307 polypeptide” is at least 54%, 60%, 70%, 75%, 80%, 85%, 90%, or 95% identical to the sequence of SEQ ID NO:34 over at least 75, 100, 125, 150, 175, or 199 contiguous amino acids. In a more desirable embodiment, a “307 protein” or a “307 polypeptide” is identical to the sequence of SEQ ID NO:34. Polypeptides encoded by splice variants of 307 nucleic acid sequences, as well as by 307 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 176 of the sequence of SEQ ID NO:33, are also included in this definition. A “307 protein” or a “307 polypeptide,” as referred to herein, plays a role in cartilage or jaw development. In zebrafish, the loss of a 307 polypeptide in a cell may result in mutants with mandibular arches that fail to extend anteriorly, and branchial arches that are slightly misshapen when compared to wild-type zebrafish. Accordingly, a “307 protein” or a “307 polypeptide” may be used as a marker for, or to prevent or treat, for example, a craniofacial disorder, such as Apert, Crouzon, Pfeiffer or Saethre-Chotzen Syndromes, or a connective tissue disease, such as rheumatoid arthritis.

[0248] By a “572 protein,” or a “572 polypeptide” is meant a polypeptide that has at least 37% amino acid sequence identity over at least 196 amino acids to the zebrafish 572 polypeptide sequence of SEQ ID NO:36. Desirably, a “572 protein” or a “572 polypeptide” is at least 37%, 40%, 50%, 60%, 70%, 80%, 85%, 90%, 95%, 97%, or 99% identical to the sequence of SEQ ID NO:36 over at least 100, 125, 150, 175, or 196 contiguous amino acids. In a more desirable embodiment, a “572 protein,” or a “572 polypeptide” is identical to the sequence of SEQ ID NO:36. Polypeptides encoded by splice variants of 572 nucleic acid sequences, as well as by 572 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 277 of SEQ ID NO:35, are also included in this definition. A zebrafish “572 protein” or a “572 polypeptide,” as referred to herein, plays a role in jaw and branchial arch development. The loss of a zebrafish 572 polypeptide in a cell may result in mutants with shorter jaws and fragmented branchial arches when compared to wild-type zebrafish. Accordingly, a “572 protein” or a “572 polypeptide” may be used as a marker for, or to prevent or treat, for example, a connective tissue disorder, such as arthritis, rheumatoid arthritis, or juvenile rheumatoid arthritis, or a craniofacial defect, such as Apert, Crouzon, Pfeiffer or Saethre-Chotzen Syndromes.

[0249] By a “1116A protein,” or a “1116A polypeptide” is meant a polypeptide that has at least 42% amino acid sequence identity over at least 191 contiguous amino acids amino acids to the zebrafish 1116A polypeptide sequence of SEQ ID NO:38. Desirably, a “1116A protein” or a “1116A polypeptide” is at least 45%, 50%, 60%, 70%, 80%, 85%, 90%, or 95% identical to the sequence of SEQ ID NO:38 over at least 75, 100, 125, 150, 175, or 191 contiguous amino acids. In a more desirable embodiment a “1116A protein,” or a “1116A polypeptide” is identical to the sequence of SEQ ID NO:38. Polypeptides encoded by splice variants of 1116A nucleic acid sequences, as well as by 1116A nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 135 of the 1116A nucleic acid sequence of SEQ ID NO:37 are also included in this definition. A “1116A protein” or a “1116A polypeptide,” as referred to herein, plays a role in jaw development. For example, the loss of a zebrafish 116A polypeptide in a cell may result in failure of the jaw to form in three-day old mutant zebrafish. Accordingly, a “1116A protein” or a “1116A polypeptide” may be used as a marker for, or to prevent or treat, a craniofacial disorder, for example, Apert, Crouzon, Pfeiffer and Saethre-Chotzen Syndromes.

[0250] By a “1548 protein,” or a “1548 polypeptide” is meant a polypeptide that has at least 76%, 80%, 85%, 90%, 95%, or 98% amino acid sequence identity over at least 500, 600, 700, 800, 900, 925, or 950 contiguous amino acids to the zebrafish 1548 polypeptide sequence of SEQ ID NO:40. Desirably, a “1548 protein” or a “1548 polypeptide” is identical to the sequence of SEQ ID NO:40. Polypeptides encoded by splice variants of 1548 nucleic acid sequences, as well as by 1548 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 85 of the 1548 nucleic acid sequence of SEQ ID NO:39 are also included in this definition. A zebrafish “1548 protein” or a “1548 polypeptide,” as referred to herein, plays a role in eye, head, heart, fin, or jaw development. The loss of a zebrafish 1548 polypeptide in a cell may result in an added structure attached to the parachordal of the neurocranium in three day-old mutant zebrafish. Accordingly, a “1548 protein” or a “1548 polypeptide” may be used as a marker for, or to prevent or treat, for example, a neurological disorder, such as Balci syndrome or Angelman syndrome, or a craniofacial defect such as Apert, Crouzon, Pfeiffer or Saethre-Chotzen Syndromes, or a congenital brain defect that causes mental retardation, such as microencephaly, anencephaly, Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly, or a congenital heart defect, such as a ventricular or atrial septal defect, or a limb formation defect, such as achondroplasia, Ellis-van Creveld syndrome, Poland sequence, or Pfeiffer syndrome.

[0251] By a “Casein Kinase 1 α protein,” or a “Casein Kinase 1 α polypeptide” is meant a polypeptide encoded by a Casein Kinase 1 α gene that has at least 99% amino acid sequence identity over at least 275, 300, or 324 amino acids to the zebrafish Casein Kinase 1 α sequence of SEQ ID NO:42. Desirably, a “Casein Kinase 1 α protein” or a “Casein Kinase 1 α polypeptide” is at least 99% or even 100% identical to the sequence of SEQ ID NO:42. Polypeptides encoded by splice variants of Casein Kinase 1 α nucleic acid sequences, as well as by Casein Kinase 1 α nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 730 and 731 of the Casein Kinase 1 α nucleic acid sequence of SEQ ID NO:41 are also included in this definition. A “Casein Kinase 1 α protein” or a “Casein Kinase 1 α polypeptide,” as referred to herein, plays a role in cartilage development. For example, in zebrafish, the loss of a Casein Kinase 1 α polypeptide in a cell may result in the retarded development of pectoral fins in three-day old mutant zebrafish. In addition, these some of these fins may be misshapen. Alcian blue staining shows that the cartilage of the fins, brancial arches, and jaw is wrinkled. Accordingly, a “Casein Kinase 1 α protein” or a “Casein Kinase 1 α polypeptide” may be used as a marker for, or to prevent or treat, a limb formation defect, for example, achondroplasia, Ellis-van Creveld syndrome, Poland sequence, or Pfeiffer syndrome, or a cartilage or connective tissue disease, such as arthritis, juvenile rheumatoid arthritis, or Marfan's syndrome, or a craniofacial defect such as Apert, Crouzon, Pfeiffer or Saethre-Chotzen Syndromes.

[0252] By a “Nodal-Related or Squint protein,” or a “Nodal-Related or Squint polypeptide” is meant a polypeptide that has at least 42% amino acid sequence identity over at least 381 amino acids to the zebrafish “Nodal-Related or Squint protein sequence of SEQ ID NO:44. Desirably, a “Nodal-Related or Squint protein” or a “Nodal-Related or Squint polypeptide” is at least 43%, 50%, 60%, 70%, 80%, 90%, or even 100% identical to the sequence of SEQ ID NO:44. Polypeptides encoded by splice variants of Nodal-Related or Squint nucleic acid sequences, as well as by Nodal-Related or Squint nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 654 of the Nodal-Related or Squint nucleic acid sequence of SEQ ID NO:43 are also included in this definition.

[0253] By a “Smoothened protein,” or a “Smoothened polypeptide” is meant a polypeptide that is, for example, identical to the zebrafish Smoothened amino acid sequence of SEQ ID NO:46. Polypeptides encoded by splice variants of Smoothened nucleic acid sequences, as well as by Smoothened nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 271 the Smoothened nucleic acid sequence of SEQ ID NO:45 are also included in this definition. A “Smoothened protein” or a “Smoothened polypeptide,” as referred to herein, plays a role in in the development of motoneurons, forebrain and midbrain commissures, body shape/axial structures, cartilage, or muscles, also, optic nerves fail to reach or cross the midline. Accordingly, a “Smoothened” or a “Smoothened polypeptide” may be used as a marker for, or to prevent or treat, for example, a congenital disorder associated with mental retardation, such as microencephaly, anencephaly, Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly, or a disorder affecting the cartilage or connective tissue, such as arthritis, juvenile rheumatoid arthritis, or Marfan's syndrome, or a disorder affecting muscle development, or a limb formation defect, such as achondroplasia, Ellis-van Creveld syndrome, Poland sequence, or Pfeiffer syndrome

[0254] By a “429 protein,” or a “429 polypeptide” is meant a polypeptide that has at least 53% amino acid sequence identity to the zebrafish 429 amino acid sequence of SEQ ID NO:48. Desirably, a “429 protein” or a “429 polypeptide” is at least 60%, 70%, 80%, 85%, 90%, 95%, 97%, 99%, or even 100% identical to the sequence of SEQ ID NO:48 over at least 500, 600, 700, or 750 contiguous amino acids. Polypeptides encoded by splice variants of 429 nucleic acid sequences, as well as by 429 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 182 and 183 of the 429 nucleic acid sequence of SEQ ID NO:47 are also included in this definition. A “429 protein” or a “429 polypeptide,” as referred to herein, plays a role in liver, gall bladder, pancreas, and gut development. The loss of a 429 polypeptide in a cell may result in the retarded development of these organs in three-day old mutant zebrafish. Accordingly, a “429 protein” or a “429 polypeptide” may be used as a marker for, or to prevent or treat, for example, a pancreatic disorder, such as diabetes.

[0255] By a “428 protein,” or a “428 polypeptide” is meant a polypeptide that has at least 62% amino acid sequence identity to the zebrafish 428 amino acid sequence of SEQ ID NO:50 over at least 170 amino acids. Desirably, a “428 protein” or a “428 polypeptide” is at least 62%, 70%, 80%, 85%, 90%, 95%, 97%, 99%, or even 100% identical to the sequence of SEQ ID NO:50 over at least 75, 100, 125, 150, 179 contiguous amino acids. Polypeptides encoded by splice variants of 428 nucleic acid sequences, as well as by 428 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 187 of the 428 nucleic acid sequence of SEQ ID NO:49 are also included in this definition. A “428 protein” or a “428 polypeptide,” as referred to herein, plays a role in muscle and brain development. The loss of a 428 polypeptide in a cell may result in defective muscles or brain necrosis. Accordingly, a “428 protein” or a “428 polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy, or a congenital brain defect that causes mental retardation, such as microencephaly, anencephaly, Down's syndrome, Chiari malformation, colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly, or a muscle defect, such as myotonia congenital, inclusion body myositis, or a congenital muscular dystrophy.

[0256] By a “Spinster protein,” or a “Spinster polypeptide” is meant a polypeptide that has at least 64% amino acid sequence identity to the zebrafish Spinster amino acid sequence of SEQ ID NO:52 over at least 528 contiguous amino acids. Desirably, a “Spinster protein” or a “Spinster polypeptide” is at least 70%, 80%, 85%, 90%, 95%, 97%, 99%, or even 100% identical to the sequence of SEQ ID NO:52 over at least 300, 400, 450, or 507 contiguous amino acids. Polypeptides encoded by splice variants of Spinster nucleic acid sequences are also included in this definition. In addition, mutations Spinster nucleic acids that result in altered expression of a Spinster polypeptide, for example, a insertion of a virus several kb, for example, 2, 3, 4, 5, 8, 10, or even 15 kb upstream of the Spinster coding region (e.g., SEQ ID NO:51), are also included in this definition. A “Spinster protein” or a “Spinster polypeptide,” as referred to herein, plays a role in the maintainance of the yolk. In zebrafish, the loss of a Spinster polypeptide in a cell may result in the degeneration of the yolk by day two of development, resulting in the gradual death of the mutant zebrafish embryo. Accordingly, a “Spinster protein” or a “Spinster polypeptide” may be used as a marker for, or to prevent or treat, for example, a fertility defect.

[0257] By a “Glypican-6 or Knypek protein,” or a “Glypican-6 or Knypek polypeptide” is meant a polypeptide encoded by a Glypican-6 or Knypek gene, that has at least 58% amino acid sequence identity to the zebrafish Glypican-6 or Knypek amino acid sequence of SEQ ID NO:54 over a region spanning at least 550 amino acids. Desirably, a “Glypican-6 or Knypek protein” or a “Glypican-6 or Knypek polypeptide” is at least 60%, 70%, 80%, 85%, 90%, 95%, 97%, 99%, or even 100% identical to the sequence of SEQ ID NO:54 over at least 300, 350, 400, 450, 500, or 557 contiguous amino acids. Polypeptides encoded by splice variants of Glypican-6 or Knypek nucleic acid sequences, as well as by Glypican-6 or Knypek nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus at nucleotide 1054 or nucleotide 133 of a Glypican-6 or Knypek nucleic acid sequence (e.g., SEQ ID NO:53) are also included in this definition. A “Glypican-6 or Knypek protein” or a “Glypican-6 or Knypek polypeptide,” as referred to herein, plays a role in the tail and somite development. For example, in zebrafish, the loss of a Glypican-6 or Knypek polypeptide in a cell may result in a shortened tail or u-shaped somites. Accordingly, a “Glypican-6 or Knypek protein” or a “Glypican-6 or Knypek polypeptide” may be used as a marker for, or to prevent or treat, for example, a muscle defect such as myotonia congenital, inclusion body myositis, or a congenital muscular dystrophy.

[0258] By a “Ribonucleotide Reductase R1 Class 1 protein,” or a “Ribonucleotide Reductase R1 Class 1 polypeptide” is meant a polypeptide encoded by a Ribonucleotide Reductase R1 Class 1 nucleic acid sequence that is identical to the zebrafish Ribonucleotide Reductase R1 Class 1 amino acid sequence of SEQ ID NO:56. Polypeptides encoded by splice variants of Ribonucleotide Reductase R1 Class 1 nucleic acid sequences, as well as by nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 147 and 148 of the Ribonucleotide Reductase R1 Class I nucleic acid sequence (e.g., SEQ ID NO:55) are also included in this definition. A “Ribonucleotide Reductase R1 Class 1 protein” or a “Ribonucleotide Reductase R1 Class 1 polypeptide,” as referred to herein, plays multiple roles in development. The loss of a Ribonucleotide Reductase R1 Class 1 polypeptide in a cell may result in a bent convex body shape. In addition, zebrafish mutant for a Ribonucleotide Reductase R1 Class 1 nucleic acid sequence display transient brain and eye necrosis between 24 and 48 hours of development. Accordingly, a “Ribonucleotide Reductase R1 Class 1 protein” or a “Ribonucleotide Reductase R1 Class 1 polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy, or a degenerative eye disorder, such as retinitis pigmentosa or other retinal disorders, such as macular degeneration, Friedreich's ataxia, or Laurence-Moon syndrome.

[0259] By a “Kinesin-Related Motor Protein EG5,” or a “Kinesin-Related Motor EG5 polypeptide” is meant a polypeptide that has at least 55% amino acid sequence identity to the zebrafish Kinesin-Related Motor Protein EG5 amino acid sequence of SEQ ID NO:58 over a region spanning at least 948 contiguous amino acids. Desirably, a “Kinesin-Related Motor Protein EG5” or a “Kinesin-Related Motor EG5 polypeptide” is at least 60%, 70%, 80%, 85%, 90%, 95%, 97%, 99%, or even 100% identical to the sequence of SEQ ID NO:58 over at least 700, 750, 800, 850, 900, or 948 contiguous amino acids. Polypeptides encoded by splice variants of Kinesin-Related Motor EG5 nucleic acid sequences, as well as by Kinesin-Related Motor EG5 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 50 and 51 of a Kinesin-Related Motor EG5 nucleic acid sequence (e.g., SEQ ID NO:57) are also included in this definition. A “Kinesin-Related Motor protein EG5” or a “Kinesin-related motor EG5 polypeptide,” as referred to herein, plays a role in cell death during development. In zebrafish, the loss of a Kinesin-Related Motor EG5 polypeptide in a cell results in a bent body shape. In addition, these zebrafish mutants display elevated levels of apoptotic cells on their surface by 48 hours of development. Accordingly, a “Kinesin-Related Motor Protein EG5” or a “Kinesin-Related Motor EG5 polypeptide” may be used as a marker for, or to prevent or treat, a neurodegenerative disease characterized by excess cell death, for example, Alzheimer's disease, Parkinson's disease, or spinocerebellar ataxia, or to treat a proliferative disorder, such as cancer.

[0260] By a “Wnt5 or Pipetail protein,” or a “Wnt5 or Pipetail polypeptide” is meant a polypeptide that is identical to a zebrafish Pipetail amino acid sequence, for example, that of SEQ ID NO:62. Polypeptides encoded by splice variants of Wnt5 or Pipetail nucleic acid sequences (e.g., SEQ ID NO:61), as well as by Wnt5 or Pipetail nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus at nucleotide 397, or between nucleotides 530 and 531 of the Wnt5 or Pipetail nucleic acid sequence of SEQ ID NO:61 nucleic acid sequence are also included in this definition. A “Wnt5 or Pipetail protein” or a “Wnt5 or Pipetail polypeptide,” as referred to herein, plays a role in cell death during development. For example, the loss of a Wnt5 or Pipetail polypeptide in a cell may result in a variably truncated tail in zebrafish. Accordingly, a “Wnt5 or Pipetail protein” or a “Wnt5 or Pipetail polypeptide” may be used as a marker for, or to prevent or treat, for example, a neurodegenerative disease characterized by excess cell death, for example, Alzheimer's disease, Parkinson's disease, or spinocerebellar ataxia.

[0261] By an “Aryl Hydrocarbon Receptor Nuclear Translocator 2A protein,” or an “Aryl Hydrocarbon Receptor Nuclear Translocator 2A polypeptide” is meant a polypeptide that, desirably, is identical to the sequence of SEQ ID NO:64. However, polypeptides encoded by splice variants of Aryl Hydrocarbon Receptor Nuclear Translocator 2A nucleic acid sequences, as well as by Aryl Hydrocarbon Receptor Nuclear Translocator 2A nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus at nucleotide 229 or 240 of the Aryl Hydrocarbon Receptor Nuclear Translocator 2A nucleic acid sequence of SEQ ID NO:63 are also included in this definition. An “Aryl Hydrocarbon Receptor Nuclear Translocator 2A protein” or an “Aryl Hydrocarbon Receptor Nuclear Translocator 2A polypeptide,” as referred to herein, plays a role in cell death during development. For example, the loss of an Aryl Hydrocarbon Receptor Nuclear Translocator 2A polypeptide in a cell may result in little motility and a weak tap response in zebrafish. Accordingly, an “Aryl Hydrocarbon Receptor Nuclear Translocator 2A protein” or an “Aryl Hydrocarbon Receptor Nuclear Translocator 2A polypeptide” may be used as a marker for, or to prevent or treat, for example, a neurodegenerative disease, for example, Alzheimer's disease, Parkinson's disease, or spinocerebellar ataxia, or a movement disorder, such as Angelman's syndrome, tardive dyskinesia, or spasticity.

[0262] By a “Vesicular Integral-Membrane Protein VIP 36 Protein,” or a “Vesicular Integral-Membrane Protein VIP 36 polypeptide” is meant a polypeptide that has at least 61% amino acid sequence identity to the zebrafish Vesicular Integral-Membrane Protein VIP 36 amino acid sequence of SEQ ID NO:66 over a region spanning at least 340 contiguous amino acids. Desirably, a “Vesicular Integral-Membrane Protein VIP 36 protein” or a “Vesicular Integral-Membrane Protein VIP 36 polypeptide” is at least 61%, 70%, 80%, 90%, or 95% identical to the sequence of SEQ ID NO:66 over at least 250, 275, 300, o4 340 contiguous amino acids. In a more desirable embodiment, a “Vesicular Integral-Membrane Protein VIP 36 Protein,” or a “Vesicular Integral-Membrane Protein VIP 36 polypeptide” is identical to the sequence of SEQ ID NO:66. Polypeptides encoded by splice variants of Vesicular Integral-Membrane Protein VIP 36 nucleic acid sequences, as well as by Vesicular Integral-Membrane Protein VIP 36 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 219 and 220 of the Vesicular Integral-Membrane Protein VIP 36 nucleic acid sequence of SEQ ID NO:65 are also included in this definition. A “Vesicular Integral-Membrane Protein VIP 36 protein” or a “Vesicular Integral-Membrane Protein VIP 36 polypeptide,” as referred to herein, plays a role touch sensitivity. For example, the loss of a Vesicular Integral-Membrane Protein VIP 36 polypeptide in a cell may result in touch insensitivity in zebrafish mutants at day five of development. Accordingly, a “Vesicular Integral-Membrane Protein VIP 36 protein” or a “Vesicular Integral-Membrane Protein VIP 36 polypeptide” may be used as a marker for, or to prevent or treat, for example, a neurodegenerative disease, such as Alzheimer's disease, Parkinson's disease, or spinocerebellar ataxia.

[0263] By a “299 Protein,” or a “299 polypeptide” is meant a polypeptide that has at least 44% amino acid sequence identity to the zebrafish 299 amino acid sequence of SEQ ID NO:68 over a region spanning 563 amino acids. Desirably, a “299” protein or a “299 polypeptide” is at least 44%, 50%, 60%, 70%, 80%, 90%, 95%, 99%, or even 100% identical to the sequence of SEQ ID NO:68. Polypeptides encoded by splice variants of 299 nucleic acid sequences, as well as by 299 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 47 and 48 of the 299 nucleic acid sequence of SEQ ID NO:67 are also included in this definition. A “299 protein” or a “299 polypeptide,” as referred to herein, plays a role in cell death. For example, in zebrafish, the loss of a 299 polypeptide in a cell results in mutants that fail to develop a jaw, branchial arches, and normal size fins by day four of development and that have apoptosis in the eye and brain. Accordingly, a “299 protein” or a “299 polypeptide” may be used as a marker for, or to prevent or treat, for example, a neurodegenerative disease characterized by excess cell death, such as Alzheimer's disease, Parkinson's disease, or spinocerebellar ataxia, or a craniofacial defect, such as Apert, Crouzon, Pfeiffer and Saethre-Chotzen Syndromes, or a connective tissue disease, such as arthritis or rheumatoid arthritis, or may be used to treat a proliferative disorder, such as cancer.

[0264] By a “994 Protein,” or a “994 polypeptide” is meant a polypeptide that has at least 35% amino acid sequence identity to the zebrafish 994 amino acid sequence of SEQ ID NO:70 over a region spanning at least 490 contiguous amino acids. Desirably, a “994 protein” or a “994 polypeptide” is at least 35%, 40%, 50%, 60%, 70%, 80%, 90%, or 95% identical to the sequence of SEQ ID NO:70 over at least 300, 350, 400, 450, or 490 contiguous amino acids. In a more desirable embodiment, a “994 Protein,” or a “994 polypeptide” is identical to the sequence of SEQ ID NO:70. Polypeptides encoded by splice variants of 994 nucleic acid sequences, as well as by 994 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 66 and 67 of a 994 nucleic acid sequence (e.g., SEQ ID NO:69) are also included in this definition. The coding region of the 994 gene may begin at nucleotide 5 or 80 of SEQ ID NO 69. In addition, a “994 protein” or a “994 polypeptide,” as referred to herein, plays a role in head and eye development. For example, in zebrafish, the loss of a 994 polypeptide in a cell results in mutants that have defects in jaw, branchial arch, fin development, an under developed stomach, and small heads and eyes. Accordingly, a “994 protein” or a “994 polypeptide” may be used as a marker for, or to prevent or treat, for example, a craniofacial disorder, such as Apert, Crouzon, Pfeiffer or Saethre-Chotzen Syndromes, or an eye malformation syndrome, such as oculorenal syndrome, or Reiger syndrome, or a limb formation defect, such as achondroplasia, Ellis-van Creveld syndrome, Poland sequence, or Pfeiffer syndrome, or a congenital brain defect that causes mental retardation, such as microencephaly, anencephaly, Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly.

[0265] By a “1373 protein,” or a “1373 polypeptide” is meant a polypeptide encoded by a 1373 nucleic acid sequence that has at least 91% amino acid sequence identity to the zebrafish 1373 amino acid sequence of SEQ ID NO:72 over a region spanning at least 110 contiguous amino acids. Desirably, a “1373 protein” or a “1373 polypeptide” is at least 91% or 95% identical to the sequence of SEQ ID NO:72 over at least 75, 100, or 110 contiguous amino acids. In a more desirable embodiment, a “1373 protein,” or a “1373 polypeptide” is identical to the sequence of SEQ ID NO:72. Polypeptides encoded by splice variants of 1373 nucleic acid sequences, as well as by 1373 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 118 and 119 of the 1373 nucleic acid sequence of SEQ ID NO:71 are also included in this definition. A “1373 protein” or a “1373 polypeptide,” as referred to herein, plays a role in development. For example, in zebrafish, the loss of a 1373 polypeptide in a cell results in mutants that display brain and eye necrosis, constriction of the anterior end of the yolk sac extension, and body curvature by day two of development. Accordingly, a “1373 protein” or a “1373 polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, Leigh's disease, or subacute necrotizing encephalomyelopathy, or a degenerative eye disorder, such as such as retinitis pigmentosa, macular degeneration, Friedreich's ataxia, or Laurence-Moon syndrome.

[0266] By a “Denticleless Protein,” or a “Denticleless polypeptide” is meant a polypeptide that has at least 46% amino acid sequence identity to the zebrafish Denticleless amino acid sequence of SEQ ID NO:74 over a region spanning at least 729 contiguous amino acids. Desirably, a “Denticleless protein” or a “denticleless polypeptide” is at least 46%, 50%, 60%, 70%, 80%, 90%, or 95% identical to the sequence of SEQ ID NO:74 over at least 500, 559, 600, 650, 700 or 729 contiguous amino acids. In a more desirable embodiment, a “Denticleless Protein,” or a “Denticleless polypeptide” is identical to the sequence of SEQ ID NO:74. Polypeptides encoded by splice variants of Denticleless nucleic acid sequences, as well as by Denticleless nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 307 and 308 of the Denticleless nucleic acid sequence of SEQ ID NO:73 are also included in this definition. A “Denticleless protein” or a “Denticleless polypeptide,” as referred to herein, plays a role in development. For example, in zebrafish, the loss of a Denticleless polypeptide in a cell results in mutants that display brain necrosis extending down the neural tube and a wrinkled yolk sac on day one of development. In addition, body curvature, wrinkled somites, irregular eye shape and the absence of the yolk sac extension are observed by day two of development. Accordingly, a “Denticleless protein” or a “Denticleless polypeptide” may be used as a marker for, or to prevent or treat, for example, a craniofacial disorder, such as Angelman's disease, or an eye malformation syndrome, for example, oculorenal syndrome, or Reiger syndrome, or a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy.

[0267] By a “Ribonucleotide Reductase Protein R2 Protein,” or a “Ribonucleotide Reductase Protein R2 polypeptide” is meant, for example, a polypeptide that is identical to the sequence of SEQ ID NO:76. However, polypeptides encoded by splice variants of Ribonucleotide Reductase Protein R2 nucleic acid sequences, as well as by Ribonucleotide Reductase Protein R2 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus at nucleotide 137 (which corresponds to position 360 of an alternatively splice form of this gene (GenBank Accession No. AW280665)) of a Ribonucleotide Reductase Protein R2 nucleic acid sequence (e.g., SEQ ID NO:75), and virus insertions at nucleotide 337 or 342 of GenBank Accession No. AW280665, are also included in this definition. A “Ribonucleotide Reductase Protein R2 protein” or a “Ribonucleotide Reductase Protein R2 polypeptide,” as referred to herein, plays a role in development. For example, in zebrafish, the loss of a Ribonucleotide Reductase Protein R2 polypeptide in a cell results in mutants that display CNS necrosis and the entire body curls up by day two of development. Accordingly, a “Ribonucleotide Reductase Protein R2 protein” or a “Ribonucleotide Reductase Protein R2 polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy.

[0268] By a “TCP-1 Alpha protein,” or a “TCP-1 Alpha polypeptide” is meant a polypeptide that, desirably, is identical to the zebrafish TCP-1 Alpha amino acid sequence of SEQ ID NO:78. Polypeptides encoded by splice variants of TCP-1 Alpha nucleic acid sequences, as well as by TCP-1 Alpha nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 130 and 131, or 140 bp upstream of, a TCP-1 Alpha nucleic acid sequence (e.g., SEQ ID NO:77) are also included in this definition. A “TCP-1 Alpha protein” or a “TCP-1 Alpha polypeptide,” as referred to herein, plays a role in development. For example, in zebrafish, the loss of a TCP-1 Alpha polypeptide in a cell results in mutants that display small brains and eyes. Accordingly, a “TCP-1 Alpha protein” or a “TCP-1 Alpha polypeptide” may be used as a marker for, or to prevent or treat, for example, a congenital brain defect that causes mental retardation, such as anencephaly, Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly, or a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat, or an eye malformation syndrome, such as oculorenal syndrome or Reiger syndrome.

[0269] By a “Telomeric Repeat Factor 2 Protein,” or a “Telomeric Repeat Factor 2 polypeptide” is meant a polypeptide encoded by a Telomeric Repeat Factor 2 gene that has at least 32% amino acid sequence identity to the zebrafish Telomeric Repeat Factor 2 amino acids sequence of SEQ ID NO:80 over a region spanning 200 amino acids. Desirably, a “Telomeric Repeat Factor 2 protein” or a “Telomeric Repeat Factor 2 polypeptide” is at least 40%, 50%, 60%, 70%, 80%, 90%, or 95% identical to the sequence of SEQ ID NO:80 over at least 75, 100, 125, 150, 175, or 200 contiguous amino acids. In a more desirable embodiment, a “Telomeric Repeat Factor 2 Protein,” or a “Telomeric Repeat Factor 2 polypeptide” is identical to the sequence of SEQ ID NO:80. Polypeptides encoded by splice variants of Telomeric Repeat Factor 2 nucleic acid sequences, as well as by Telomeric Repeat Factor 2 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 529 and 530 of the Telomeric Repeat Factor 2 nucleic acid sequence of SEQ ID NO:79 are also included in this definition. A “Telomeric Repeat Factor 2 protein” or a “Telomeric Repeat Factor 2 polypeptide,” as referred to herein, plays a role in brain and eye development. For example, in zebrafish, the loss of a Telomeric Repeat Factor 2 polypeptide in a cell results in mutants that display brain and eye necrosis by day two of development. Accordingly, a “Telomeric Repeat Factor 2 protein” or a “Telomeric Repeat Factor 2 polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy, or a degenerative eye disease, such as retinitis pigmentosa, macular degeneration, Friedreich's ataxia, or Laurence-Moon syndrome.

[0270] By a “SIL Protein,” or a “SIL polypeptide” is meant a polypeptide that has at least 36% amino acid sequence identity to the zebrafish SIL amino acid sequence of SEQ ID NO:82 over a region spanning 1363 amino acids. Desirably, a “SIL protein” or a “SIL polypeptide” is at least 36%, 50%, 60%, 70%, 80%, or 90% identical to the sequence of SEQ ID NO:82 over at least 700, 800, 900, 1000, 1100, 1200, 1300, or 1363 contiguous amino acids. In a more desirable embodiment, a “SIL Protein,” or a “SIL polypeptide” is identical to the sequence of SEQ ID NO:82. Polypeptides encoded by splice variants of SIL nucleic acid sequences, as well as by SIL nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 273 and 274 of the SIL nucleic acid sequence of SEQ ID NO:81 are also included in this definition. A “SIL protein” or a “SIL polypeptide,” as referred to herein, plays a role in development of the head and body. For example, in zebrafish, the loss of a SIL polypeptide in a cell results in mutants that have a head that is 33% smaller than wild-type by day two of development. In addition, these mutants exhibit brain necrosis, a bent body, and motility defects. Accordingly, a “SIL protein” or a “SIL polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, for example, Leigh's disease, or subacute necrotizing encephalomyelopathy, a congenital brain defect that causes mental retardation, such as anencephaly, Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly, or a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat, or an eye malformation syndrome, such as oculorenal syndrome or Reiger syndrome, or a degenerative eye disease, such as retinitis pigmentosa or other retinal disorders, such as macular degeneration, Friedrich ataxia, or Laurence-Moon syndrome.

[0271] By a “U1 Small Nuclear Ribonucleoprotein C Protein,” or a “U1 Small Nuclear Ribonucleoprotein C polypeptide” is meant a polypeptide that has at least 84% amino acid sequence identity to the zebrafish U1 Small Nuclear Ribonucleoprotein C amino acid sequence of SEQ ID NO:84 over a region spanning at least 159 contiguous amino acids. Desirably, a “U1 Small Nuclear Ribonucleoprotein C protein” or a “U1 Small Nuclear Ribonucleoprotein C polypeptide” is at least 90%, 95%, or 98% identical to the sequence of SEQ ID NO:84 over at least 75, 100, 125, or 159 contiguous amino acids. In a more desirable embodiment, a “U1 Small Nuclear Ribonucleoprotein C Protein,” or a “U1 Small Nuclear Ribonucleoprotein C polypeptide” is identical to the sequence of SEQ ID NO:84. Polypeptides encoded by splice variants of U1 Small Nuclear Ribonucleoprotein C nucleic acid sequences, as well as by U1 Small Nuclear Ribonucleoprotein C nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 52 and 53 of the U1 Small Nuclear Ribonucleoprotein C nucleic acid sequence of SEQ ID NO:83 are also included in this definition. A “U1 Small Nuclear Ribonucleoprotein C protein” or a “U1 Small Nuclear Ribonucleoprotein C polypeptide,” as referred to herein, plays a role in development of the brain, eyes and ears. For example, in zebrafish, the loss of a U1 Small Nuclear Ribonucleoprotein C polypeptide in a cell results in mutants that display motility defects, a body that curves upward, brain necrosis, smaller eyes and otoliths than wild-type zebrafish, pigment in the hind-brain, and retarded fin development. Accordingly, a “U1 Small Nuclear Ribonucleoprotein C protein” or a “U1 Small Nuclear Ribonucleoprotein C polypeptide” may be used as a marker for, or to prevent or treat, for example, a sensory disorder, such as Usher syndrome, Waardenburg syndrome, or other disorders characterized by congenital deafness or blindness, or an eye malformation syndrome, such as oculorenal syndrome or Reiger syndrome, or a necrotizing brain disorder, for example, Leigh's disease, or subacute necrotizing encephalomyelopathy, or a movement disorder, such as Angelman's syndrome, tardive dyskinesia, spasticity, ataxias, or spinocerebellar ataxia.

[0272] By a “Ski Interacting Protein,” or a “Ski Interacting polypeptide” is meant a polypeptide that has at least 84% amino acid sequence identity to the zebrafish Ski Interacting Protein amino acid sequence of SEQ ID NO:86 over a region spanning at least 536 contiguous amino acids. Desirably, a “Ski Interacting Protein protein” or a “Ski Interacting Protein polypeptide” is at least 90%, 95%, or 98% identical to the sequence of SEQ ID NO:86 over at least 300, 350, 400, 450, 500, or 536 contiguous amino acids. In a more desirable embodiment, a “Ski Interacting Protein,” or a “Ski Interacting polypeptide” is identical to the sequence of SEQ ID NO:86. Polypeptides encoded by splice variants of Ski Interacting Protein nucleic acid sequences, as well as by Ski Interacting Protein nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus approximately 1.2 kb upstream from the beginning of the Ski Interacting Protein nucleic acid sequence of SEQ ID NO:85 are also included in this definition. A “Ski Interacting Protein” or a “Ski Interacting Protein polypeptide,” as referred to herein, plays a role in development of the brain and body. For example, in zebrafish, the loss of a Ski Interacting Protein polypeptide in a cell results in mutants that display extensive brain necrosis, a lack of brain divisions, a curved body, and abnormal motility by day two of development. Accordingly, a “Ski Interacting Protein protein” or a “Ski Interacting Protein polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy, or a congenital brain defect that causes mental retardation, such as microencephaly, anencephaly, Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly, or a movement disorder such as Angelman's syndrome, tardive dyskinesia, spasticity, ataxias, or spinocerebellar ataxia.

[0273] By a “297 protein,” or a “297 polypeptide” is meant a polypeptide that has at least 77% amino acid sequence identity to the zebrafish 297 amino acid sequence of SEQ ID NO:88 over a region spanning at least 624 contiguous amino acids. Desirably, a “297 protein” or a “297 polypeptide” is at least 77%, 85%, 90%, or 95% identical to the sequence of SEQ ID NO:88 over at least 400, 500, 550, 600, or 624 contiguous amino acids. In a more desirable embodiment, “297 protein,” or a “297 polypeptide” is identical to the sequence of SEQ ID NO:88. Polypeptides encoded by splice variants of 297 nucleic acid sequences, as well as by 297 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus at nucleotide 74 of the 297 nucleic acid sequence of SEQ ID NO:87 are also included in this definition. A “297 protein” or a “297 polypeptide,” as referred to herein, plays a role in development of the brain, tail, cartilage, ethmoid plate, and the jaw. For example, in zebrafish, the loss of a 297 polypeptide in a cell results in mutants that display brain necrosis, and a kinked tail. Accordingly, a “297 protein” or a “297 polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, for example, Leigh's disease, or subacute necrotizing encephalomyelopathy, or a congenital brain defect that causes mental retardation, such as microencephaly, anencephaly, Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly, or a craniofacial defect, such as Apert, Crouzon, Pfeiffer or Saethre-Chotzen Syndromes.

[0274] By a “TCP-1 Complex Gamma Chain protein,” or a “TCP-1 Complex Gamma Chain polypeptide” is meant a polypeptide that has at least 87% amino acid sequence identity to the zebrafish TCP-1 Complex Gamma Chain amino acid sequence of SEQ ID NO:90 over a region spanning at least 541 contiguous amino acids. Desirably, a “TCP-1 Complex Gamma Chain protein” or a “TCP-1 Complex Gamma Chain polypeptide” is at least 90%, 95%, or 98% identical to the sequence of SEQ ID NO:90 over at least 300, 350, 400, 450, 500, or 541 contiguous amino acids. In a more desirable embodiment, a “TCP-1 Complex Gamma Chain protein,” or a “TCP-1 Complex Gamma Chain polypeptide” is identical to the sequence of SEQ ID NO:90. Polypeptides encoded by splice variants of TCP-1 Complex Gamma Chain nucleic acid sequences, as well as by TCP-1 Complex Gamma Chain nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 75 and 76 of the TCP-1 Complex Gamma Chain nucleic acid sequence of SEQ ID NO:89 are also included in this definition. A “TCP-1 Complex Gamma Chain protein” or a “TCP-1 Complex Gamma Chain polypeptide,” as referred to herein, plays a role in development of the yolk sac. For example, in zebrafish, the loss of a TCP-1 Complex Gamma Chain polypeptide in a cell results in mutants that display a thinner yolk sac extension than wild-type.

[0275] By a “Small Nuclear Ribonucleoprotein D1 protein,” or a “Small Nuclear Ribonucleoprotein D1 polypeptide” is meant a polypeptide that has at least 97% amino acid sequence identity to the zebrafish Small Nuclear Ribonucleoprotein D1 amino acid sequence of SEQ ID NO:92 over at least 119 contiguous amino acids. Desirably, a “Small Nuclear Ribonucleoprotein D1 protein” or a “Small Nuclear Ribonucleoprotein D1 polypeptide” is at least 97%, 98%, 99%, or even 100% identical to the sequence of SEQ ID NO:92 over at least 75, 100, or 119 contiguous amino acids. Polypeptides encoded by splice variants of Small Nuclear Ribonucleoprotein D1 nucleic acid sequences, as well as by Small Nuclear Ribonucleoprotein D1 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 76 and 77 of the Small Nuclear Ribonucleoprotein D1 nucleic acid sequence of SEQ ID NO:91 are also included in this definition. A “Small Nuclear Ribonucleoprotein D1 protein” or a “Small Nuclear Ribonucleoprotein D1 polypeptide,” as referred to herein, plays a role in development of the CNS. For example, in zebrafish, the loss of a Small Nuclear Ribonucleoprotein D1 polypeptide in a cell results in mutants that display an inflated hind-brain and increased necrosis in the CNS, particularly in the eye. Accordingly, a “Small Nuclear Ribonucleoprotein D1 protein” or a “Small Nuclear Ribonucleoprotein D1 polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy, or a degenerative eye disease, such as retinitis pigmentosa, macular degeneration, Friedreich's ataxia, or Laurence-Moon syndrome, or a congenital brain defect that causes mental retardation, such as microencephaly, anencephaly, Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly.

[0276] By a “DNA Polymerase Epsilon Subunit B protein,” or a “DNA Polymerase Epsilon Subunit B polypeptide” is meant a polypeptide that has at least 74% amino acid sequence identity to the zebrafish DNA Polymerase Epsilon Subunit B amino acid sequence of SEQ ID NO:94 over at least 527 contiguous amino acids. Desirably, a “DNA Polymerase Epsilon Subunit B protein” or a “DNA Polymerase Epsilon Subunit B polypeptide” is at least 80%, 90%, 95% or 98% identical to the sequence of SEQ ID NO:94 over at least 300, 350, 400, 500, or 527 contiguous amino acids. In a more desirable embodiment, a “DNA Polymerase Epsilon Subunit B protein,” or a “DNA Polymerase Epsilon Subunit B polypeptide” is identical to the sequence of SEQ ID NO:94. Polypeptides encoded by splice variants of DNA Polymerase Epsilon Subunit B nucleic acid sequences, as well as by DNA Polymerase Epsilon Subunit B nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 1161 and 1162, or at nucleotide 929 of the DNA Polymerase Epsilon Subunit B nucleic acid sequence of SEQ ID NO:93 are also included in this definition. A “DNA Polymerase Epsilon Subunit B protein” or a “DNA Polymerase Epsilon Subunit B polypeptide,” as referred to herein, plays a role in development of the CNS. For example, in zebrafish, the loss of a DNA Polymerase Epsilon Subunit B polypeptide in a cell results in mutants that display increased necrosis in the brain and eye. Accordingly, a “DNA Polymerase Epsilon Subunit B protein” or a “DNA Polymerase Epsilon Subunit B polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy, or a degenerative eye disease, such as retinitis pigmentosa, macular degeneration, Friedreich's ataxia, or Laurence-Moon syndrome.

[0277] By an “821-02 protein,” or an “821-02 polypeptide” is meant a polypeptide that has at least 52% amino acid sequence identity to the zebrafish 821-02 amino acid sequence of SEQ ID NO:96 over at least 683 contiguous amino acids. Desirably, an “821-02 protein” or an “821-02 polypeptide” is at least 52%, 60%, 70%, 80%, 90%, 95%, or 98% identical to the sequence of SEQ ID NO:96 over at least 400, 450, 500, 550, 600, 650, or 683 contiguous amino acids. In a more desirable embodiment, an “821-02 protein,” or an “821-02 polypeptide” is identical to the sequence of SEQ ID NO:96. Polypeptides encoded by splice variants of 821-02 nucleic acid sequences, as well as by 821-02 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 231 and 232, or between nucleotides 369 and 370, of the 821-02 nucleic acid sequence of SEQ ID NO:95 are also included in this definition. An “821-02 protein” or an “821-02 polypeptide,” as referred to herein, plays a role in cell death. For example, in zebrafish, the loss of an 821-02 polypeptide in a cell results in mutants that display extensive apoptosis in the CNS and the eye by 24 to 48 hours of development as visualized by acridine orange staining. Accordingly, an “821-02 protein” or an “821-02 polypeptide” may be used as a marker for, or to prevent or treat, a neurodegenerative disease characterized by excess cell death, for example, Alzheimer's disease, Parkinson's disease, or spinocerebellar ataxia, or a degenerative eye disease, such as retinitis pigmentosa, macular degeneration, Friedreich's ataxia, or Laurence-Moon syndrome.

[0278] By a “1045 protein,” or a “1045 polypeptide” is meant a polypeptide that has at least 76% amino acid sequence identity to the zebrafish 1045 amino acid sequence of SEQ ID NO:98 over a region that spans at least 265 contiguous amino acids. Desirably, a “1045 protein” or a “1045 polypeptide” is at least 80%, 90%, or 95% identical to the sequence of SEQ ID NO:98 over at least 150, 175, 200, 225, or 265 contiguous amino acids. In a more desirable embodiment, a “1045 protein,” or a “1045 polypeptide” is identical to the sequence of SEQ ID NO:98. Polypeptides encoded by splice variants of 1045 nucleic acid sequences, as well as by 1045 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 216 and 344 of the 1045 nucleic acid sequence of SEQ ID NO:97 are also included in this definition. A “1045 protein” or a “1045 polypeptide,” as referred to herein, plays a role in brain development. For example, in zebrafish, the loss of a 1045 polypeptide in a cell results in mutants that display severe brain and head necrosis at 24 hours of development. Accordingly, a “1045 protein” or a “1045 polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, for example, Leigh's disease, or subacute necrotizing encephalomyelopathy.

[0279] By a “1055-1 protein,” or a “1055-1 polypeptide” is meant a polypeptide that has at least 70% amino acid sequence identity to the zebrafish 1055-1 amino acid sequence of SEQ ID NO:100 over a region that spans at least 285 contiguous amino acids. Desirably, a “1055-1 protein” or a “1055-1 polypeptide” is at least 70%, 75%, 80%, 90%, or 95% identical to the sequence of SEQ ID NO:100 over at least 150, 175, 200, 225, 250, or 285 contiguous amino acids. In a more desirable embodiment, sa “1055-1 protein,” or a “1055-1 polypeptide” is identical to the sequence of SEQ ID NO:100. Polypeptides encoded by splice variants of 1055-1 nucleic acid sequences, as well as by 1055-1 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 167 and 168 of the 1055-1 nucleic acid sequence of SEQ ID NO:99 are also included in this definition. A “1055-1 protein” or a “1055-1 polypeptide,” as referred to herein, plays a role in yolk sac development. For example, in zebrafish, the loss of a 1055-1 polypeptide in a cell results in mutants that display a misshapen or missing yolk sac extension and a tail that bends down. A “1055-1 protein” or a “1055-1 polypeptide” may be used as a marker for, or to prevent or treat, a proliferative disorder.

[0280] By a “Spliceosome Associated Protein 49 protein,” or a “Spliceosome Associated Protein 49 polypeptide” is meant a polypeptide that has at least 80% amino acid sequence identity to the zebrafish Spliceosome Associated Protein 49 amino acid sequence of SEQ ID NO:102 over a region that spans at least 322 contiguous amino acids. Desirably, a “Spliceosome Associated Protein 49 protein” or a “Spliceosome Associated Protein 49 polypeptide” is at least 85%, 90%, 95% or 98% identical to the sequence of SEQ ID NO:102 over at least 200, 225, 250, or 300, or 322 contiguous amino acids. In a more desirable embodiment, a “Spliceosome Associated Protein 49 protein,” or a “Spliceosome Associated Protein 49 polypentide” is identical to the sequence of SEQ ID NO:102. Polypeptides encoded by splice variants of Spliceosome Associated Protein 49 nucleic acid sequences, as well as by Spliceosome Associated Protein 49 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 53 and 54 of the Spliceosome Associated Protein 49 nucleic acid sequence of SEQ ID NO:101 are also included in this definition. A “Spliceosome Associated Protein 49 protein” or a “Spliceosome Associated Protein 49 polypeptide,” as referred to herein, plays a role in brain development. For example, in zebrafish, the loss of a Spliceosome Associated Protein 49 polypeptide in a cell results in mutants that display tectal necrosis and a bent body by day two of development. Accordingly, a “Spliceosome Associated Protein 49 protein” or a “Spliceosome Associated Protein 49 polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy.

[0281] By a “DNA Replication Licensing Factor MCM7 protein,” or a “DNA Replication Licensing Factor MCM7 polypeptide” is meant a polypeptide that has at least 75% amino acid sequence identity to the zebrafish DNA Replication Licensing Factor MCM7 amino acid sequence of SEQ ID NO:104 over at least 175, 100, 125, 150, 175, or 194 contiguous amino acids. Desirably, a “DNA Replication Licensing Factor MCM7 protein” or a “DNA Replication Licensing Factor MCM7 polypeptide” is at least 75%, 80%, 90%, or 95% identical to the sequence of SEQ ID NO:104. In a more desirable embodiment, a “DNA Replication Licensing Factor MCM7 protein” or a “DNA Replication Licensing Factor MCM7 polypeptide” is identical to the sequence of SEQ ID NO: 104. Polypeptides encoded by splice variants of DNA Replication Licensing Factor MCM7 nucleic acid sequences, as well as by DNA Replication Licensing Factor MCM7 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 121 and 122, or at nucleotide 198, of the DNA Replication Licensing Factor MCM7 nucleic acid sequence of SEQ ID NO:103 are also included in this definition. A “DNA Replication Licensing Factor MCM7 protein” or a “DNA Replication Licensing Factor MCM7 polypeptide,” as referred to herein, plays a role in eye and CNS development. For example, in zebrafish, the loss of a DNA Replication Licensing Factor MCM7 polypeptide in a cell results in mutants that display severe eye and CNS necrosis during late day one, or early in day two of development. Accordingly, a “DNA Replication Licensing Factor MCM7 protein” or a “DNA Replication Licensing Factor MCM7 polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy disease, or a degenerative eye disease, such as retinitis pigmentosa, macular degeneration, Friedreich's ataxia, or Laurence-Moon syndrome.

[0282] By a “DEAD-Box RNA Helicase (DEAD5 or DEAD19) protein,” or a “DEAD-Box RNA Helicase (DEAD5 or DEAD19) polypeptide” is meant a polypeptide that has at least 84% amino acid sequence identity to the zebrafish DEAD-Box RNA Helicase (DEAD5 or DEAD19) amino acid sequence of SEQ ID NO:106 over a region that spans at least 487 contiguous amino acids. Desirably, a “DEAD-Box RNA Helicase (DEAD5 or DEAD19) protein” or a “DEAD-Box RNA Helicase (DEAD5 or DEAD19) polypeptide” is at least 84%, 90%, 95%, or 98% identical to the sequence of SEQ ID NO:106 over at least 300, 350, 400, 450, or 457 contiguous amino acids. In a more desirable embodiment, a “DEAD-Box RNA Helicase (DEAD5 or DEAD19) protein,” or a “DEAD-Box RNA Helicase (DEAD5 or DEAD19) polypeptide” is identical to the sequence of SEQ ID NO:106. Polypeptides encoded by splice variants of DEAD-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid sequences, as well as by DEAD-Box RNA Helicase (DEAD5 or DEAD19) nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus at nucleotide 132 of the DEAD-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid sequence of SEQ ID NO:105 are also included in this definition. A “DEAD-Box RNA Helicase (DEAD5 or DEAD19) protein” or a “DEAD-Box RNA Helicase (DEAD5 or DEAD19) polypeptide,” as referred to herein, plays a role in brain development. For example, in zebrafish, the loss of a DEAD-Box RNA Helicase (DEAD5 or DEAD19) polypeptide in a cell results in mutants that display severe brain necrosis at 24 hours, and are dead by the second day of development. Accordingly, a “DEAD-Box RNA Helicase (DEAD5 or DEAD19) protein” or a “DEAD-Box RNA Helicase (DEAD5 or DEAD19) polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy disease.

[0283] By a “1581 protein,” or a “1581 polypeptide” is meant a polypeptide that has at least 48% amino acid sequence identity to the zebrafish 1581 amino acid sequence of SEQ ID NO:108 over a region that spans at least 273 contiguous amino acids. Desirably, a “1581 protein” or a “1581 polypeptide” is at least 55%, 60%, 70%, 75%, 80%, 85%, 90%, or 95% identical to the sequence of SEQ ID NO:108 over at least 150, 175, 200, 225, 250, or 273 contiguous amino acids. In a more desirable embodiment, a “1581 protein” or a “1581 polypeptide” is identical to the sequence of SEQ ID NO:108. Polypeptides encoded by splice variants of 1581 nucleic acid sequences, as well as by 1581 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 346 and 347 of the 1581 nucleic acid sequence of SEQ ID NO:107 are also included in this definition. A “1581 protein” or a “1581 polypeptide,” as referred to herein, plays a role in brain and eye development. For example, in zebrafish, the loss of a 1581 polypeptide in a cell results in mutants that display brain and eye necrosis and brains and eyes that are 50% smaller than those of wild-type zebrafish by the third day of development. Accordingly, a “1581 protein” or a “1581 polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy, or an eye malformation syndrome, for example, oculorenal syndrome, or Reiger syndrome, or a congenital brain defect that causes mental retardation, such as microencephaly, anencephaly, Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly.

[0284] By a “Cyclin A2 protein,” or a “Cyclin A2 polypeptide” is meant a polypeptide that is, for example, identical to the sequence of SEQ ID NO:110. Polypeptides encoded by splice variants of Cyclin A2 nucleic acid sequences, as well as by Cyclin A2 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus at nucleotide 374 or 401 of the Cyclin A2 nucleic acid sequence of SEQ ID NO:109 are also included in this definition. A “Cyclin A2 protein” or a “Cyclin A2 polypeptide,” as referred to herein, plays a role head development. For example, in zebrafish, the loss of a Cyclin A2 polypeptide in a cell results in mutants that display heads that are 66% smaller than those of wild-type zebrafish by the fifth day of development display eye and CNS necrosis, no jaw, and abnormal semicircular canals. Accordingly, a “Cyclin A2 protein” or a “Cyclin A2 polypeptide” may be used as a marker for, or to prevent or treat, for example, a proliferative disease, sensory disorders, for example, Usher syndrome, Waardenburg syndrome, or other disorders characterized by congenital deafness or blindness, or a neurodegenerative disease such as Alzheimer's disease, Parkinson's disease, or spinocerebellar ataxia, or a congenital brain defect that causes mental retardation, such as microencephaly, anencephaly, Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly, or a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy disease.

[0285] By an “Imitation Switch (ISWI)/SNF2 protein,” an “ISWI/SNF2 protein,” or an “Imitation Switch (ISWI)/SNF2 polypeptide,” or an “ISWI/SNF2 polypeptide” is meant a polypeptide that has at least 70% amino acid sequence identity to the zebrafish ISWI/SNF2 amino acid sequence of SEQ ID NO:112 over at least 75, 100, 125, or 145 contiguous amino acids. Desirably, an “ISWI/SNF2 protein” or an “ISWI/SNF2 polypeptide” is at least 70%, 80%, 85%, 90%, or 95% identical to the sequence of SEQ ID NO:112. In a more desirable embodiment, an “ISWI/SNF2 protein,” or an “ISWI/SNF2 polypeptide” is identical to the sequence of SEQ ID NO:112. Polypeptides encoded by splice variants of ISWI/SNF2nucleic acid sequences, as well as by ISWI/SNF2 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus at nucleotide 76 of the ISWI/SNF2 nucleic acid sequence of SEQ ID NO:111 are also included in this definition. An “ISWI/SNF2 protein” or an “ISWI/SNF2 polypeptide,” as referred to herein, plays a role in eye, brain and jaw development. For example, in zebrafish, the loss of an ISWI/SNF2 polypeptide in a cell results in mutants that display eye necrosis, and necrosis of the inner cell ganglion layer and the optic tectum. The eyes of these mutants are 25% smaller those of wild-type zebrafish by the fourth day of development, and their lower jaw has dropped. Accordingly, an “ISWI/SNF2 protein” or an “ISWI/SNF2 polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy, or an eye malformation syndrome, such as syndrome, or Reiger syndrome, or a degenerative eye disease, such as retinitis pigmentosa or other retinal disorders, for example, macular degeneration, Friedreich's ataxia, or Laurence-Moon syndrome.

[0286] By a “Chromosomal Assembly Protein C (XCAP-C) protein,” an “XCAP-C protein,” or a “Chromosomal Assembly Protein C (XCAP-C) polypeptide,” or an “XCAP-C polypeptide” is meant a polypeptide that has at least 70% amino acid sequence identity to the zebrafish XCAP-C amino acid sequence of SEQ ID NO:114 over at least 600, 700, 800, 900, or 979 contiguous amino acids. Desirably, a “XCAP-C protein” or a “XCAP-C polypeptide” is at least 70%, 75%, 80%, 85%, 90%, or 95% identical to the sequence of SEQ ID NO:114. In a more desirable embodiment, a “XCAP-C protein,” or a “XCAP-C polypeptide” is identical to the sequence of SEQ ID NO:114. Polypeptides encoded by splice variants of XCAP-C nucleic acid sequences, as well as by XCAP-C nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus at nucleotide 181 and 182 of the XCAP-C nucleic acid sequence of SEQ ID NO: 113 are also included in this definition. A “XCAP-C protein” or a “XCAP-C polypeptide,” as referred to herein, plays a role in brain and eye development. For example, in zebrafish, the loss of a XCAP-C polypeptide in a cell results in mutants that display necrosis in the eye, optic tectum and hind-brain by day two of development. Accordingly, a “XCAP-C protein” or a “XCAP-C polypeptide” may be used as a marker for, or to prevent or treat, for example, a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy, or a degenerative eye disease, such as retinitis piginentosa, macular degeneration, Friedreich's ataxia, or Laurence-Moon syndrome.

[0287] By a “DNA Replication Licensing Factor MCM2 protein,” or a “DNA Replication Licensing Factor MCM2 polypeptide” is meant a polypeptide that has at least 79% amino acid sequence identity to the zebrafish DNA Replication Licensing Factor MCM2 amino acid sequence of SEQ ID NO:116 over a region that spans at least 893 contiguous amino acids. Desirably, a “DNA Replication Licensing Factor MCM2 protein” or a “DNA Replication Licensing Factor MCM2 polypeptide” is at least 79%, 85%, 90%, or 95% identical to the sequence of SEQ ID NO:116 over at least 600, 700, 800, 850, or 893 contiguous amino acids. In a more desirable embodiment, a “DNA Replication Licensing Factor MCM2 protein,” or a “DNA Replication Licensing Factor MCM2 polypeptide” is identical to the sequence of SEQ ID NO:116. Polypeptides encoded by splice variants of DNA Replication Licensing Factor MCM2 nucleic acid sequences, as well as by DNA Replication Licensing Factor MCM2 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus at nucleotide 399 of the DNA Replication Licensing Factor MCM2 nucleic acid sequence of SEQ ID NO:115 are also included in this definition. A “DNA Replication Licensing Factor MCM2 protein” or a “DNA Replication Licensing Factor MCM2 polypeptide,” as referred to herein, plays a role in brain and eye development. For example, in zebrafish, the loss of a DNA Replication Licensing Factor MCM2 polypeptide in a cell results in mutants that display necrosis in the optic tectum. In addition the eyes of these mutants are smaller than those of wild-type zebrafish, and they have abnormal jaws and branchial arches by day five of development. Accordingly, a “DNA Replication Licensing Factor MCM2 protein” or a “DNA Replication Licensing Factor MCM2 polypeptide” may be used as a marker for, or to prevent or treat, a craniofacial defect, for example, Apert, Crouzon, Pfeiffer and Saethre-Chotzen Syndromes, or, for example, a necrotizing brain disorder, for example, Leigh's disease, or subacute necrotizing encephalomyelopathy, an eye malformation syndrome, such as oculorenal syndrome or Reiger syndrome, or a craniofacial defect, such as Apert, Crouzon, Pfeiffer or Saethre-Chotzen Syndromes.

[0288] By a “DNA Replication Licensing Factor MCM3 protein,” or a “DNA Replication Licensing Factor MCM3 polypeptide” is meant a polypeptide that has at least 86% amino acid sequence identity to the zebrafish DNA Replication Licensing Factor MCM3 amino acid sequence of SEQ ID NO:118 over a region that spans at least 178 contiguous amino acids. Desirably, a “DNA Replication Licensing Factor MCM3 protein” or a “DNA Replication Licensing Factor MCM3 polypeptide” is at least 86%, 90%, or 95% identical to the sequence of SEQ ID NO:118 over at least 100, 125, 150, or 178 contiguous amino acids. In a more desirable embodiment, a “DNA Replication Licensing Factor MCM3 protein,” or a “DNA Replication Licensing Factor MCM3 polypeptide” is identical to the sequence of SEQ ID NO:118. Polypeptides encoded by splice variants of DNA Replication Licensing Factor MCM3 nucleic acid sequences, as well as by DNA Replication Licensing Factor MCM3 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus at nucleotide 50, or between nucleotides 75 and 76 of the DNA Replication Licensing Factor MCM3 nucleic acid sequence of SEQ ID NO:117 are also included in this definition. A “DNA Replication Licensing Factor MCM3 protein” or a “DNA Replication Licensing Factor MCM3 polypeptide,” as referred to herein, plays a role in eye and brain development. For example, in zebrafish, the loss of a DNA Replication Licensing Factor MCM3 polypeptide in a cell results in mutants that display necrosis in the optic tectum. In addition the head and eyes of these mutants are at least 25% smaller than those of wild-type zebrafish by day three of development. Accordingly, a “DNA Replication Licensing Factor MCM3 protein” or a “DNA Replication Licensing Factor MCM3 polypeptide” may be used as a marker for, or to prevent or treat, a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat, or an eye malformation syndrome, such as oculorenal syndrome or Reiger syndrome.

[0289] By a “Valyl-tRNA Synthase protein,” or a “Valyl-tRNA Synthase polypeptide” is meant a polypeptide that has at least 52% amino acid sequence identity to the zebrafish Valyl-tRNA Synthase amino acid sequence of SEQ ID NO:120 over at least 250, 300, 350, 400, or 440 contiguous amino acids. Desirably, a “Valyl-tRNA Synthase protein” or a “Valyl-tRNA Synthase polypeptide” is at least 52%, 60%, 70%, 80%, 90%, or 95% identical to the sequence of SEQ ID NO:120. In a more desirable embodiment, a “Valyl-tRNA Synthase protein,” or a “Valyl-tRNA Synthase polypeptide” is identical to the sequence of SEQ ID NO:120. Polypeptides encoded by splice variants of Valyl-tRNA Synthase nucleic acid sequences, as well as by Valyl-tRNA Synthase nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 30 and 31 of the Valyl-tRNA Synthase nucleic acid sequence of SEQ ID NO:119 are also included in this definition. A “Valyl-tRNA Synthase protein” or a “Valyl-tRNA Synthase polypeptide,” as referred to herein, plays a role in cell death or pigmentation. For example, in zebrafish, the loss of a Valyl-tRNA Synthase polypeptide in a cell results in mutants that display apoptosis in the brain. In addition, the head and eyes of these mutants are smaller than those of wild-type zebrafish by day three of development. The eyes of the mutant zebrafish are also lighter in color than those of wild-type zebrafish. Accordingly, a “Valyl-tRNA Synthase protein” or a “Valyl-tRNA Synthase polypeptide” may be used as a marker for, or to prevent or treat, for example, a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat, or a neurodegenerative disease characterized by excess cell death, for example, Alzheimer's disease, Parkinson's disease, or spinocerebellar ataxia, or disorders related to pigmentation, such as human pigmentary glaucoma, Wagner's disease, Cockayne's syndrome, Kearns-Sayre disease, Waardenburg syndrome, Usher's syndrome, or Multiple lentigines syndrome.

[0290] By a “40S Ribosomal Protein S5 protein,” or a “40S Ribosomal Protein S5 polypeptide” is meant a polypeptide that has at least 96% amino acid sequence identity to the zebrafish 40S Ribosomal Protein S5 amino acid sequence of SEQ ID NO:122 over a region that spans at least 202 contiguous amino acids. Desirably, a “40S Ribosomal Protein S5 protein” or a “40S Ribosomal Protein S5 polypeptide” is at least 98% or 99% identical to the sequence of SEQ ID NO:122 over at least 100, 125, 150, 175, or 202 contiguous amino acids. In a more desirable embodiment, a “40S Ribosomal Protein S5 protein” or a “40S Ribosomal Protein S5 polypeptide” is idenical to SEQ ID NO:122. Polypeptides encoded by splice variants of 40S Ribosomal Protein S5 nucleic acid sequences, as well as by 40S Ribosomal Protein SS nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 31 and 32 of the 40S Ribosomal Protein SS nucleic acid sequence of SEQ ID NO:121 are also included in this definition. A “40S Ribosomal Protein S5 protein” or a “40S Ribosomal Protein S5 polypeptide,” as referred to herein, plays a role in brain development. For example, in zebrafish, the loss of a 40S Ribosomal Protein S5 polypeptide in a cell results in mutants that display a swollen “bubble-brain” phenotype at day two of development. In addition, the mutant zebrafish display persistent motility defects. Accordingly, a “40S Ribosomal Protein S5 protein” or a “40S Ribosomal Protein S5 polypeptide” may be used as a marker for, or to prevent or treat, for example, a congenital brain defect, such as hydranencephaly.

[0291] By a “TCP-1 Beta protein,” or a “TCP-1 Beta polypeptide” is meant a polypeptide that has at least 86% amino acid sequence identity to the zebrafish TCP-1 Beta amino acid sequence of SEQ ID NO:124 over a region that spans at least 507 contiguous amino acids. Desirably, a “TCP-1 Beta protein” or a “TCP-1 Beta polypeptide” is at least 90%, 95%, 98%, 99%, or even 100% identical to the sequence of SEQ ID NO:124 over at least 300, 350, 400, 450, or 507 contiguous amino acids. Polypeptides encoded by splice variants of TCP-1 Beta nucleic acid sequences, as well as by TCP-1 Beta nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 63 and 64 of the TCP-1 Beta nucleic acid sequence of SEQ ID NO: 123 are also included in this definition. A “TCP-1 Beta protein” or a “TCP-1 Beta polypeptide,” as referred to herein, plays a role in head development. For example, in zebrafish, the loss of a TCP-1 Beta polypeptide in a cell results in mutants that display jaw and cartilage defects, as well as a small head and eyes at day three of development as compared to wild-type zebrafish. Accordingly, a “TCP-1 Beta protein” or a “TCP-1 Beta polypeptide” may be used as a marker for, or to prevent or treat, for example, a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat, or a disorder affecting the cartilage or connective tissue, such as arthritis, or a craniofacial defect, such as Apert, Crouzon, Pfeiffer and Saethre-Chotzen Syndromes, or an eye malformation syndrome, such as Reiger syndrome.

[0292] By a “TCP-1 Eta protein,” or a “TCP-1 Eta polypeptide” is meant a polypeptide that has at least 88% amino acid sequence identity to the zebrafish TCP-1 Eta amino acid sequence of SEQ ID NO:126 over a region that spans at least 541 contiguous amino acids. Desirably, a “TCP-1 Eta protein” or a “TCP-1 Eta polypeptide” is at least 88%, 90%, 95%, 98%, 99%, or even 100% identical to the sequence of SEQ ID NO:126 over at least 350, 400, 450, 500, or 541 contiguous amino acids. Polypeptides encoded by splice variants of TCP-1 Eta nucleic acid sequences, as well as by TCP-1 Eta nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 32 and 33 of the TCP-1 Eta nucleic acid sequence of SEQ ID NO:125 are also included in this definition. A “TCP-1 Eta protein” or a “TCP-1 Eta polypeptide,” as referred to herein, plays a role in head and eyedevelopment. For example, in zebrafish, the loss of a TCP-1 Eta polypeptide in a cell results in mutants that display, for example, smaller head and smaller eyes when compared to age-matched wild-type zebrafish. Accordingly, a “TCP-1 Eta protein” or a “TCP-1 Eta polypeptide” may be used as a marker for, or to prevent or treat, for example, a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat, disease.

[0293] By a “Translation Elongation Factor eEF1 Alpha protein,” or a “Translation Elongation Factor eEF1 Alpha polypeptide” is meant a polypeptide that is, desirably, identical to the zebrafish Translation Elongation Factor eEF1 Alpha amino acid sequence of SEQ ID NO:128. Polypeptides encoded by splice variants of Translation Elongation Factor eEF1 Alpha nucleic acid sequences, as well as by Translation Elongation Factor eEF1 Alpha nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 60 and 61 of the Translation Elongation Factor eEF1 Alpha nucleic acid sequence of SEQ ID NO:127 are also included in this definition. A “Translation Elongation Factor eEF1 Alpha protein” or a “Translation Elongation Factor eEF1 Alpha polypeptide,” as referred to herein, plays a role in cell death. For example, in zebrafish, the loss of a Translation Elongation Factor eEF1 Alpha polypeptide in a cell results in mutants that display increased apoptosis in the head and eyes. These mutants display a head and eyes that are at least 33% smaller then those of age-matched wild-type zebrafish. Accordingly, a “Translation Elongation Factor eEF1 Alpha protein” or a “Translation Elongation Factor eEF1 Alpha polypeptide” may be used as a marker for, or to prevent or treat, for example, a neurodegenerative disease characterized by excess cell death, such as Alzheimer's disease, Parkinson's disease, or spinocerebellar ataxia, or a degenerative eye disease, such as retinitis pigmentosa, macular degeneration, Friedreich's ataxia, or Laurence-Moon syndrome, or a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat.

[0294] By a “1257 protein,” or a “1257 polypeptide” is meant a polypeptide that has at least 49% amino acid sequence identity to the zebrafish 1257 amino acid sequence of SEQ ID NO:130 over a region spanning at least 372 contiguous amino acids. Desirably, a “1257 protein” or a “1257 polypeptide” is at least 49%, 60%, 70%, 80%, 90%, or 95% identical to the sequence of SEQ ID NO:130 over at least 250, 300, 350, or 372 contiguous amino acids. In a more desirable embodiment, a “1257 protein,” or a “1257 polypeptide” is identical to the sequence of SEQ ID NO:130. Polypeptides encoded by splice variants of 1257 nucleic acid sequences, as well as by 1257 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus at nucleotide 175 of the 1257 nucleic acid sequence of SEQ ID NO:129 are also included in this definition. A “1257 protein” or a “1257 polypeptide,” as referred to herein, plays a role in head and eye development. For example, in zebrafish, the loss of a 1257 polypeptide in a cell results in mutants having a head and eyes that are at least 25% smaller than those of age-matched four-day old wild-type zebrafish. In addition these zebrafish have an underdeveloped jaw. Accordingly, a “1257 protein” or a “1257 polypeptide” may be used as a marker for, or to prevent or treat, for example, a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat, or an eye malformation syndrome, such as oculorenal syndrome, or Reiger syndrome, or a craniofacial defect, such as Apert, Crouzon, Pfeiffer or Saethre-Chotzen Syndromes.

[0295] By a “60S Ribosomal Protein L24 protein,” or a “60S Ribosomal Protein L24 polypeptide” is meant a polypeptide that has at least 89% amino acid sequence identity to the zebrafish 60S Ribosomal Protein L24 amino acid sequence of SEQ ID NO:132 over a region spanning at least 157 contiguous amino acids. Desirably, a “60S Ribosomal Protein L24 protein” or a “60S Ribosomal Protein L24 polypeptide” is at least 89%, 90%, or 95% identical to the sequence of SEQ ID NO:132 over at least 75, 100, 125, or 157 contiguous amino acids. In a more desirable embodiment, a “60S Ribosomal Protein L24 protein,” or a “60S Ribosomal Protein L24 polypeptide” is identical to the sequence of SEQ ID NO:132. Polypeptides encoded by splice variants of 60S Ribosomal Protein L24 nucleic acid sequences, as well as by 60S Ribosomal Protein L24 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus between nucleotides 144 and 145 of the 60S Ribosomal Protein L24 nucleic acid sequence of SEQ ID NO:131 are also included in this definition. A “60S Ribosomal Protein L24 protein” or a “60S Ribosomal Protein L24 polypeptide,” as referred to herein, plays a role in head development. For example, in zebrafish, the loss of a 60S Ribosomal Protein L24 polypeptide in a cell results in mutants that display having a head and eyes that are at least 33% smaller than those of age-matched wild-type zebrafish. Accordingly, a “60S Ribosomal Protein L24 protein” or a “60S Ribosomal Protein L24 polypeptide” may be used as a marker for, or to prevent or treat, for example, a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat, or an eye malformation syndrome, such as oculorenal syndrome, or Reiger syndrome.

[0296] By a “Non-Muscle Adenylosuccinate Synthase protein,” or a “Non-Muscle Adenylosuccinate Synthase polypeptide” is meant a polypeptide that has at least 76% amino acid sequence identity to the zebrafish Non-Muscle Adenylosuccinate Synthase amino acid sequence of SEQ ID NO:134 over a region spanning at least 175 contiguous amino acids. Desirably, a “Non-Muscle Adenylosuccinate Synthase” or a “Non-Muscle Adenylosuccinate Synthase polypeptide” is at least 80%, 90%, or 95% identical to the sequence of SEQ ID NO:134 over at least 75, 100, 125, 150, or 175 contiguous amino acids. In a more desirable embodiment, a “Non-Muscle Adenylosuccinate Synthase protein,” or a “Non-Muscle Adenylosuccinate Synthase polypeptide” is identical to the sequence of SEQ ID NO:134. Polypeptides encoded by splice variants of Non-Muscle Adenylosuccinate Synthase nucleic acid sequences, as well as by Non-Muscle Adenylosuccinate Synthase nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus in the Non-Muscle Adenylosuccinate Synthase nucleic acid sequence of SEQ ID NO:133 between nucleotides 217 and 218, or at nucleotide 209, are also included in this definition. A “Non-Muscle Adenylosuccinate Synthase protein” or a “Non-Muscle Adenylosuccinate Synthase polypeptide,” as referred to herein, plays a role in head and eye development and cell death regulation. For example, in zebrafish, the loss of a Non-Muscle Adenylosuccinate Synthase polypeptide in a cell results in mutants that display a head and eyes that are at least 50% smaller than those of age-matched wild-type zebrafish. In addition, these mutants have some apoptotic. Cells and lack jaws and branchial arches. Accordingly, a “Non-Muscle Adenylosuccinate Synthase protein” or a “Non-Muscle Adenylosuccinate Synthase polypeptide” may be used as a marker for, or to prevent or treat, for example, a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat, or a neurodegenerative disease characterized by excess cell death, such as Alzheimer's disease, Parkinson's disease, or spinocerebellar ataxia, or a cartilage or connective tissue disorder, such as arthritis or juvenile rheumatoid arthritis, or a craniofacial defect characterized by an underdeveloped jaw, such as Treacher-Collins syndrome or Marfan's syndrome.

[0297] By a “Nuclear Cap Binding Protein Subunit 2 protein,” or a “Nuclear Cap Binding Protein Subunit 2 polypeptide” is meant a polypeptide that has at least 85% amino acid sequence identity to the zebrafish Nuclear Cap Binding Protein Subunit 2 amino acid sequence of SEQ ID NO:136 over a region spanning at least 143 contiguous amino acids. Desirably, a “Nuclear Cap Binding Protein Subunit 2 Protein” or a “Nuclear Cap Binding Protein Subunit 2 polypeptide” is at least 90%, 95%, or 98% identical to the sequence of SEQ ID NO:136 over at least 75, 100, or 135 contiguous amino acids. In a more desirable embodiment, a “Nuclear Cap Binding Protein Subunit 2 protein,” or a “Nuclear Cap Binding Protein Subunit 2 polypeptide” is identical to the sequence of SEQ ID NO:136. Polypeptides encoded by splice variants of Nuclear Cap Binding Protein Subunit 2 nucleic acid sequences, as well as by Nuclear Cap Binding. Protein Subunit 2 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus in the Nuclear Cap Binding Protein Subunit 2 nucleic acid sequence of SEQ ID NO:135 between nucleotides 137 and 138, or at nucleotide 209, are also included in this definition. A “Nuclear Cap Binding Protein Subunit 2 protein” or a “Nuclear Cap Binding Protein Subunit 2 polypeptide,” as referred to herein, plays a role in head and eye development. For example, in zebrafish, the loss of a Nuclear Cap Binding Protein Subunit 2 polypeptide in a cell results in mutants that display a head and eyes that are at least 25% smaller than those of age-matched wild-type zebrafish by day four of development. In addition, these mutants have transient necrosis in the CNS between 24 and 48 hours of development, underdeveloped jaws, an underdeveloped stomach, and lack branchial arches three and four. Accordingly, a “Nuclear Cap Binding Protein Subunit 2 protein” or a “Nuclear Cap Binding Protein Subunit 2 polypeptide” may be used as a marker for, or to prevent or treat, for example, a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat, or a craniofacial defect, such as Apert, Crouzon, Pfeiffer and Saethre-Chotzen Syndromes, or a necrotizing brain disorder, such as Leigh's disease, or subacute necrotizing encephalomyelopathy, or a degenerative eye disease, such as retinitis pigmentosa, macular degeneration, Friedreich's ataxia, or Laurence-Moon syndrome, or a cartilage or connective tissue disease, such as arthritis.

[0298] By an “Ornithine Decarboxylase protein,” or an “Ornithine Decarboxylase polypeptide” is meant a polypeptide encoded by a Ornithine Decarboxylase gene that, desirably, is identical to the zebrafish Ornithine Decarboxylase amino acid sequence of SEQ ID NO:138. Polypeptides encoded by splice variants of Ornithine Decarboxylase nucleic acid sequences, as well as by Ornithine Decarboxylase nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus in the Ornithine Decarboxylase nucleic acid sequence of SEQ ID NO:137 between nucleotides 97 and 98 are also included in this definition. An “Ornithine Decarboxylase protein” or an “Ornithine Decarboxylase polypeptide,” as referred to herein, plays a role in head development and necrosis. For example, in zebrafish, the loss of an Ornithine Decarboxylase polypeptide in a cell results in mutants that display display a head and eyes that are at least 33% smaller than those of age-matched wild-type zebrafish. In addition, these mutants have underdeveloped jaws and branchial arches relative to wild-type zebrafish. Accordingly, an “Ornithine Decarboxylase protein” or an “Ornithine Decarboxylase polypeptide” may be used as a marker for, or to prevent or treat, for example, a craniofacial defect, such as Apert, Crouzon, Pfeiffer or Saethre-Chotzen Syndromes, or a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat, or an eye malformation syndrome, such as oculorenal syndrome, or Reiger syndrome, or a cartilage or connective tissue disorder, such as arthritis.

[0299] By a “Protein Phosphatase 1 Nuclear Targeting Subunit protein (PNUTS),” or a “Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) polypeptide” is meant a polypeptide that has at least 55% amino acid sequence identity to the zebrafish Protein Phosphatase 1 Nuclear Targeting Subunit amino acid sequence of SEQ ID NO:140 over a region spanning at least 636 contiguous amino acids. Desirably, a “Protein Phosphatase 1 Nuclear Targeting Subunit protein” or a “Protein Phosphatase 1 Nuclear Targeting Subunit polypeptide” is at least 60%, 70%, 80%, 90%, 95%, or 98% identical to the sequence of SEQ ID NO:140. In a more desirable embodiment, a “Protein Phosphatase 1 Nuclear Targeting Subunit protein (PNUTS),” or a “Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) polypeptide” is identical to the sequence of SEQ ID NO:140 over at least 400, 500, 550, 600, or 636 contiguous amino acids. Polypeptides encoded by splice variants of Protein Phosphatase 1 Nuclear Targeting Subunit nucleic acid sequences, as well as by Protein Phosphatase 1 Nuclear Targeting Subunit nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus in the Protein Phosphatase 1 Nuclear Targeting Subunit nucleic acid sequence of SEQ ID NO:139 at nucleotide 303 are also included in this definition. A “Protein Phosphatase 1 Nuclear Targeting Subunit protein” or a “Protein Phosphatase 1 Nuclear Targeting Subunit polypeptide,” as referred to herein, plays a role in head development. For example, in zebrafish, the loss of a Protein Phosphatase 1 Nuclear Targeting Subunit polypeptide in a cell results in mutants having slightly smaller head than those of age-matched wild-type zebrafish. In addition, these mutants have slightly compressed jaws and an underdeveloped gut relative to wild-type zebrafish. Accordingly, a “Protein Phosphatase 1 Nuclear Targeting Subunit protein” or a “Protein Phosphatase 1 Nuclear Targeting Subunit polypeptide” may be used as a marker for, or to prevent or treat, for example, craniofacial defects characterized by an underdeveloped jaw, such as Treacher-Collins syndrome or Marfan's syndrome.

[0300] By a “Mitochondrial Inner Membrane Translocating Protein (rTIM23),” or a “Mitochondrial Inner Membrane Translocating (rTIM23) polypeptide” is meant a polypeptide that has at least 79% amino acid sequence identity to the zebrafish Mitochondrial Inner Membrane Translocating Protein (rTIM23) amino acid sequence of SEQ ID NO:142 over at least 75, 100, 125, 150, 175, or 190 contiguous amino acids. Desirably, a “Mitochondrial Inner Membrane Translocating Protein” or a “Mitochondrial Inner Membrane Translocating Protein (rTIM23) polypeptide” is at least 79%, 80%, or 90% identical to the sequence of SEQ ID NO:142. In a more desirable embodiment, a “Mitochondrial Inner Membrane Translocating Protein (rTIM23),” or a “Mitochondrial Inner Membrane Translocating (rTIM23) polypeptide” is identical to the sequence of SEQ ID NO:142. Polypeptides encoded by splice variants of Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid sequences, as well as by Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus in the Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid sequence of SEQ ID NO:141 at nucleotide 100 are also included in this definition. A “Mitochondrial Inner Membrane Translocating Protein” or a “Mitochondrial Inner Membrane Translocating Protein (rTIM23) polypeptide,” as referred to herein, plays a role in development of the circulatory system. For example, in zebrafish, the loss of a Mitochondrial Inner Membrane Translocating polypeptide in a cell results in mutants that display display lighter eyes than wild-type zebrafish and defects in the tail blood vessel. In addition, most of these mutants are dying by day four of development. Accordingly, a “Mitochondrial Inner Membrane Translocating Protein” or a “Mitochondrial Inner Membrane Translocating polypeptide” may be used as a marker for, or to prevent or treat, for example, a pigmentation disorder, such as retinitis pigmentosa, human pigmentary glaucoma, Wagner's disease, Cockayne's syndrome, Kearns-Sayre disease, Waardenburg syndrome, Usher's syndrome, or Multiple lentigines syndrome, or a circulatory disorder, such as stroke.

[0301] By a “1447 protein,” or a “1447 polypeptide” is meant a polypeptide that has at least 59% amino acid sequence identity to the zebrafish 1447 amino acid sequence of SEQ ID NO:144 over a region spanning at least 738 contiguous amino acids. Desirably, a “1447 protein” or a “1447 polypeptide” is at least 70%, 80%, 90%, 95%, or 98% identical to the sequence of SEQ ID NO:144 over at least 500, 550, 600, 650, 700, or 738 contiguous amino acids. In a more desirable embodiment, a “1447 protein,” or a “1447 polypeptide” is identical to the sequence of SEQ ID NO:144. Polypeptides encoded by splice variants of 1447 nucleic acid sequences, as well as by 1447 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus in the 1447 nucleic acid sequence of SEQ ID NO:143 between nucleotides 227 and 228 are also included in this definition. A “1447 protein” or a “1447 polypeptide,” as referred to herein, plays a role in development of the head, eyes, and jaw. For example, in zebrafish, the loss of a 1447 polypeptide in a cell results in mutants that display head and eyes that are at least 25% smaller than those of three day-old wild-type zebrafish. In addition, these mutants lack, or have severely reduced, mandibular or branchial arches. Furthermore, these mutants have shorter fins, an underdeveloped stomach, lack a pancreas, and have bent tails. Accordingly, a “1447 protein” or a “1447 polypeptide” may be used as a marker for, or to prevent or treat, for example, a pancreatic disorder, such as diabetes, or a limb formation defect, such as achondroplasia, Ellis-van Creveld syndrome, Poland sequence, or Pfeiffer syndrome, or a cartilage or connective tissue disease, such as arthritis, or a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat.

[0302] By an “ARS2 protein,” or an “ARS2 polypeptide” is meant a polypeptide that has at least 69% amino acid sequence identity to the zebrafish ARS2 amino acid sequence of SEQ ID NO:146 over a region spanning at least 917 contiguous amino acids. Desirably, an “ARS2 protein” or an “ARS2 polypeptide” is at least 79%, 80%, 90%, 95%, or 98% identical to the sequence of SEQ ID NO:146 over at least 700, 750, 800, 850, or 917 contiguous amino acids. In a more desirable embodiment, an “ARS2 protein,” or an “ARS2 polypeptide” is identical to the sequence of SEQ ID NO:146. Polypeptides encoded by splice variants of ARS2 nucleic acid sequences, as well as by ARS2 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus in the ARS2 nucleic acid sequence of SEQ ID NO:145 between nucleotides 103 and 104 are also included in this definition. An “ARS2 protein” or an “ARS2 polypeptide,” as referred to herein, plays a role jaw development and pigmentation. For example, in zebrafish, the loss of an ARS2 polypeptide in a cell results in mutants that display underdeveloped jaws and have necrosis in the tectum. In addition, these mutants have flecks of pigment in the otholiths and widespread edema by day five of development. Accordingly, an “ARS2 protein” or an “ARS2 polypeptide” may be used as a marker for, or to prevent or treat, such as a pigmentation disorder, such as retinitis pigmentosa, human pigmentary glaucoma, Wagner's disease, Cockayne's syndrome, Kearns-Sayre disease, Waardenburg syndrome, Usher's syndrome, or multiple lentigines syndrome, or craniofacial defects characterized by an underdeveloped jaw, such as Treacher-Collins syndrome or Marfan's syndrome.

[0303] By a “Sec61 Alpha protein,” or a “Sec61 Alpha polypeptide” is meant a polypeptide that, desirably, is identical to the zebrafish Sec61 Alpha amino acid sequence of SEQ ID NO:148 over a region spanning at least 190 contiguous amino acids. However, polypeptides encoded by splice variants of Sec61 Alpha nucleic acid sequences, as well as by Sec61 Alpha nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus in the Sec61 Alpha nucleic acid sequence of SEQ ID NO:147 between nucleotides 132 and 133 are also included in this definition. A “Sec61 Alpha protein” or a “Sec61 Alpha polypeptide,” as referred to herein, plays a role head, eye and body development. For example, in zebrafish, the loss of a Sec61 Alpha polypeptide in a cell results in mutants that display a bent body, head and eyes that are at least 25% smaller than those of age-matched wild-type zebrafish, and having a lack of development of the jaw or branchial arches. Accordingly, a “Sec61 Alpha protein” or a “Sec61 Alpha polypeptide” may be used as a marker for, or to prevent or treat, craniofacial defects characterized by an underdeveloped jaw, such as Treacher-Collins syndrome or Marfan's syndrome, or a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat, or an eye malformation syndrome, such as oculorenal syndrome, or Reiger syndrome.

[0304] By a “BAF53a protein,” or a “BAF53a polypeptide” is meant a polypeptide that has at least 88% amino acid sequence identity to the zebrafish BAF53a amino acid sequence of SEQ ID NO:150 over a region spanning at least 429 contiguous amino acids. Desirably, a “BAF53a protein” or a “BAF53a polypeptide” is at least 90%, 95%, or 99% identical to the sequence of SEQ ID NO:150 over a region spanning at least 300, 350, 400, or 429 contiguous amino acids. In a more desirable embodiment, a “BAF53a protein,” or a “BAF53a polypeptide” is identical to the sequence of SEQ ID NO:150. Polypeptides encoded by splice variants of BAF53a nucleic acid sequences, as well as by BAF53A nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus in the BAF53A nucleic acid sequence of SEQ ID NO:149 at nucleotide 160 are also included in this definition. A “BAF53a protein” or a “BAF53A polypeptide,” as referred to herein, plays a role body, eye, and brain development. For example, in zebrafish, the loss of a BAF53a polypeptide in a cell results in fish that display a curved body, small underdeveloped eyes, and enlarged ventricles. Accordingly, a “BAF53a protein” or a “BAF53a polypeptide” may be used as a marker for, or to prevent or treat, for example, an eye malformation syndrome, such as oculorenal syndrome, or Reiger syndrome, or a congenital brain defect that causes mental retardation, such as microencephaly, anencephaly, Down's syndrome, Chiari Malformation, Colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly.

[0305] By a “Histone Deacetylase protein,” or a “Histone Deacetylase polypeptide” is meant a polypeptide that has at least 90% amino acid sequence identity to the zebrafish Histone Deacetylase amino acid sequence of SEQ ID NO:152 over a region spanning at least 483 contiguous amino acids. Desirably, a “Histone Deacetylase protein” or a “Histone Deacetylase polypeptide” is at least 93%, 95%, or 98% identical to the sequence of SEQ ID NO:152 over at least 300, 350, 400, 450, or 483 contiguous amino acids. In a more desirable embodiment, a “Histone Deacetylase protein” or a “Histone Deacetylase polypeptide” is identical to the sequence of SEQ ID NO:152. Polypeptides encoded by splice variants of Histone Deacetylase nucleic acid sequences, as well as by Histone Deacetylase nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus in the Histone Deacetylase nucleic acid sequence of SEQ ID NO: 151 between nucleotides 98 and 99, or at nucleotide 88, are also included in this definition. A “Histone Deacetylase protein” or a “Histone Deacetylase polypeptide,” as referred to herein, plays a role in the development of the heart, eyes, semicircular canals, otoliths, and cartilaginous structures. For example, in zebrafish, the loss of a Histone Deacetylase polypeptide in a cell results in mutants that display enlarged hearts, with atria twice the size of wild-type, eyes that are 33% smaller than wild-type, and ears that lack semicircular canals and have close together or fused otoliths. In addition, fin buds, jaws and branchial arches all fail to develop. Moreover, at day five of development, Alcian blue fails to stain any cartilage corresponding to the pectoral fins, jaw, branchial arches, or the neurocranium. Accordingly, a “Histone Deacetylase protein” or a “Histone Deacetylase polypeptide” may be used as a marker for, or to prevent or treat, for example, hearing disorders, such as Usher syndrome, Waardenburg syndrome, or other disorders characterized by congenital deafness; or a cartilage, connective tissue, or bone disorder, such as arthritis, or a limb formation defect, such as achondroplasia, Ellis-van Creveld syndrome, Poland sequence, or Pfeiffer syndrome, or a congenital heart defect, such as an atrial or ventricular septal defect, or an eye malformation syndrome, such as oculorenal syndrome, or Reiger syndrome.

[0306] By a “Fibroblast Isoform of the ADP/ATP Carrier protein,” or a “Fibroblast Isoform of the ADP/ATP Carrier polypeptide” is meant a polypeptide that has at least 93% amino acid sequence identity to the zebrafish Fibroblast Isoform of the Fibroblast Isoform of the ADP/ATP Carrier Protein amino acid sequence of SEQ ID NO:154 over a region spanning at least 298 contiguous amino acids. Desirably, a “Fibroblast Isoform of the ADP/ATP Carrier Protein protein” or a “Fibroblast Isoform of the ADP/ATP Carrier Protein polypeptide” is at least 93% or 95% identical to to the sequence of SEQ ID NO:154 over a region spanning at least 175, 200, 225, 250, 275, or 298 contiguous amino acids. In more desirable embodiments, a “Fibroblast Isoform of the ADP/ATP Carrier protein,” or a “Fibroblast Isoform of the ADP/ATP Carrier polypeptide” is identical to the sequence of SEQ ID NO:154. Polypeptides encoded by splice variants of Fibroblast Isoform of the ADP/ATP Carrier Protein nucleic acid sequences, as well as by Fibroblast Isoform of the ADP/ATP Carrier Protein nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus in the Fibroblast Isoform of the ADP/ATP Carrier Protein nucleic acid sequence of SEQ ID NO:153 between nucleotides 178 and 179 are also included in this definition. A “Fibroblast Isoform of the ADP/ATP Carrier Protein protein” or a “Fibroblast Isoform of the ADP/ATP Carrier Protein polypeptide,” as referred to herein, plays a role in lung or, in zebrafish, swim bladder development. For example, in zebrafish, the loss of a Fibroblast Isoform of the ADP/ATP Carrier Protein polypeptide in a cell results in mutants that display swim bladders that fail to inflate. Accordingly, a “Fibroblast Isoform of the ADP/ATP Carrier Protein protein” or a “Fibroblast Isoform of the ADP/ATP Carrier Protein polypeptide” may be used as a marker for, or to prevent or treat, for example, a pulmonary disease, such as asthma.

[0307] By a “TAFII-55 protein,” or a “TAFII-55 polypeptide” is meant a polypeptide that has at least 68% amino acid sequence identity to the zebrafish TAFII-55 amino acid sequence of SEQ ID NO:156 over a region spanning at least 336 contiguous amino acids. Desirably, a “TAFII-55 protein” or a “TAFII-5 polypeptide” is at least 80%, 90%, 95%, or 98% identical to the sequence of SEQ ID NO:156 over a region spanning at least 336 contiguous amino acids. In a desirable embodiment, a “TAFII-55 protein,” or a “TAFII-55 polypeptide” is identical to the sequence of SEQ ID NO:156. Polypeptides encoded by splice variants of TAFII-55 nucleic acid sequences, as well as by TAFII-55 nucleic acids containing a mutation, for example, a mutation resulting from the insertion of a virus in the TAFII-55 nucleic acid sequence of SEQ ID NO:155 between nucleotides 107 and 108 are also included in this definition. A “TAFII-55 protein” or a “TAFII-55 polypeptide,” as referred to herein, plays a role in head, eye, lung, or, in zebrafish, swim bladder development. For example, in zebrafish, the loss of a TAFII-55 polypeptide in a cell results in mutants that display swim bladders that fail to fill. In addition the mutants have heads that are approximately 20% smaller, and eyes that are approximately 33% smaller than those of age-matched wild-type zebrafish. Accordingly, a “TAFII-55 protein” or a “TAFII-55 polypeptide” may be used as a marker for, or to prevent or treat, for example, a disorder associated with microcephaly, such as Angelman's syndrome or cri du chat, or a pulmonary disease, such as asthma, or an eye malformation syndrome, such as oculorenal syndrome, or Reiger syndrome.

[0308] By a “904 nucleic acid sequence” is meant a nucleic acid molecule that is at least 79%, 85%, 90%, 95%, or 98% identical, or to the sequence of SEQ ID NO:1. In a desirable embodiment, a “904 nucleic acid sequence” is identical to the sequence of SEQ ID NO:1. Such nucleic acid molecules include genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 904 polypeptide (e.g., SEQ ID NO:2) or a portion thereof, as defined above. A mutation in a 904 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, at nucleotide 1315 of SEQ ID NO:1, using methods described herein. In addition, the invention includes mutations that result in aberrant 904 expression or function, including, as examples, null mutations and mutations causing truncations.

[0309] By a “POU2 nucleic acid sequence” is meant a nucleic acid sequence that is identical to the zebrafish POU2 nucleic acid sequence of SEQ ID NO:3. However, nucleic acid molecules consisting of splice variants of POU2 nucleic acid sequences, as well as POU2 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 653 or 1088 of SEQ ID NO:3, are also included in this definition. By a “POU2 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a POU2 polypeptide or a zebrafish (SEQ ID NO:4,) POU2 polypeptide, or a portion thereof, as defined above. A mutation in a POU2 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant POU2 expression or function, including, as examples, null mutations and mutations causing truncations.

[0310] By a “40S Ribosomal Protein S18 nucleic acid sequence” is meant a nucleic acid sequence that has at least 97%, 98%, or 99% identity to the zebrafish 40S Ribosomal Protein S18 nucleic acid sequence of SEQ ID NO:5 over at least 152, 200, 250, 300, 400, or 500 nucleotides. In a desirable embodiment, a “40S Ribosomal Protein S18 nucleic acid sequence” is identical to the sequence of SEQ ID NO:5. However, nucleic acid molecules consisting of splice variants of 40S Ribosomal Protein S18 nucleic acid sequences, as well as 40S Ribosomal Protein S18 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 220 and 221 of SEQ ID NO:5, are also included in this definition. By a “40S Ribosomal Protein S18 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 40S Ribosomal Protein S18 polypeptide or a zebrafish (e.g., SEQ ID NO:6) 40S Ribosomal Protein S18 polypeptide, or a portion thereof, as defined above. A mutation in a 40S Ribosomal Protein S18 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 40S Ribosomal Protein S18 expression or function, including, as examples, null mutations and mutations causing truncations.

[0311] By a “U2AF nucleic acid sequence” is meant a nucleic acid sequence that has at least 82%, 85%, 90%, 95%, or 97% identity to the zebrafish U2AF nucleic acid sequence of SEQ ID NO:7 over at least 100, 200, 300, 400, 537, 750, 1000, or 1500 nucleotides. In a desirable embodiment, a “U2AF nucleic acid sequence” is identical to the sequence of SEQ ID NO:7. Nucleic acid molecules consisting of splice variants of U2AF nucleic acid sequences, as well as U2AF nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 46 and 47 of SEQ ID NO:7, are also included in this definition. By a “U2AF nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a U2AF polypeptide or a zebrafish U2AF polypeptide (e.g., SEQ ID NO:8), or a portion thereof, as defined above. A mutation in a U2AF nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant U2AF expression or function, including, as examples, null mutations and mutations causing truncations.

[0312] By a “954 nucleic acid sequence” is meant a nucleic acid sequence that has at least 40%, 45%, 50%, 55%, 60%, 70%, 80%, 90%, 95%, or 98% identity to the zebrafish 954 nucleic acid sequence of SEQ ID NO:9 over at least 100, 200, 250, 300, 400, 500, 600, 750, 1000, 1250, 1500, 1750, 2000, or 2100 contiguous nucleotides. In a desirable embodiment, a “954 nucleic acid sequence” is identical to the sequence of SEQ ID NO:9. Nucleic acid molecules consisting of splice variants of 954 nucleic acid sequences, as well as 954 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 432 or 506 of SEQ ID NO:9, are also included in this definition. By a “954 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 954 polypeptide or a zebrafish 954 polypeptide (e.g., SEQ ID NO:10), or a portion thereof, as defined above. A mutation in a 954 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 954 expression or function, including, only as examples, null mutations and mutations causing truncations.

[0313] By a “Neurogenin Related Protein-1 nucleic acid sequence” or a “Nrp-1 nucleic acid sequence” is meant a nucleic acid sequence that identical to a zebrafish Nrp-1 nucleic acid sequence, for example, that of SEQ ID NO:11. Nucleic acid molecules consisting of splice variants of Nrp-1 nucleic acid sequences, as well as Nrp-1 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 1149 of SEQ ID NO:11, are also included in this definition. By a “Nrp-1 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Nrp-1 polypeptide or a zebrafish Nrp-1 polypeptide (e.g., SEQ ID NO:12), or a portion thereof, as defined above. A mutation in a Nrp-1 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Nrp-1 expression or function, including, only as examples, null mutations and mutations causing truncations.

[0314] By a “Caudal nucleic acid sequence” or “Cad-1 nucleic acid sequence” is meant a nucleic acid sequence that is identical to a zebrafish Caudal or Cad-1 nucleic acid sequence, for example, that of SEQ ID NO:13. Nucleic acid molecules consisting of splice variants of Cad-1 nucleic acid sequences, as well as Cad-1 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 583 and 584 of SEQ ID NO:13, are also included in this definition. By a “Caudal nucleic acid sequence” or “Cad-1 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Caudal or Cad-1 polypeptide or a zebrafish Cad-1 polypeptide (e.g., SEQ ID NO:14), or a portion thereof, as defined above. A mutation in a cad-1 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Caudal or Cad-1 expression or function, including, only as examples, null mutations and mutations causing truncations.

[0315] By a “V-ATPase Alpha Subunit nucleic acid sequence” is meant a nucleic acid sequence that has at least 85%, 90%, 95%, or 98% identity to the zebrafish V-ATPase Alpha Subunit nucleic acid sequence of SEQ ID NO:15 over at least 800, 1000, or 1200 nucleotides. In a desirable embodiment, a “V-ATPase Alpha Subunit nucleic acid sequence” is identical to the sequence of SEQ ID NO:15. Nucleic acid molecules consisting of splice variants of V-ATPase Alpha Subunit nucleic acid sequences, as well as V-ATPase Alpha Subunit nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 169 of SEQ ID NO:15, are also included in this definition. By a “V-ATPase Alpha Subunit nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a V-ATPase Alpha Subunit polypeptide or a zebrafish V-ATPase Alpha Subunit polypeptide (e.g., SEQ ID NO:16), or a portion thereof, as defined above. A mutation in a V-ATPase Alpha Subunit nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant V-ATPase Alpha Subunit expression or function, including, only as examples, null mutations and mutations causing truncations.

[0316] By a “V-ATPase SFD Subunit nucleic acid sequence” is meant a nucleic acid sequence that has at least 81%, 85%, 90%, 95%, or 98% identity to the zebrafish V-ATPase SFD Subunit nucleic acid sequence of SEQ ID NO:17 over at least 100, 200, 300, 400, 537, 750, 1000, or 1500 nucleotides. In a desirable embodiment, a “V-ATPase SFD Subunit nucleic acid sequence” is identical to the sequence of SEQ ID NO:17. Nucleic acid molecules consisting of splice variants of V-ATPase SFD Subunit nucleic acid sequences, as well as V-ATPase SFD Subunit nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 31 and 32 of SEQ ID NO:17, are also included in this definition. By a “V-ATPase SFD Subunit nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a V-ATPase SFD subunit polypeptide or a zebrafish V-ATPase SFD Subunit polypeptide (e.g., SEQ ID NO:18), or a portion thereof, as defined above. A mutation in a V-ATPase SFD Subunit nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant V-ATPase SFD Subunit expression or function, including, as examples, null mutations and mutations causing truncations.

[0317] By a “1463 nucleic acid sequence” is meant a nucleic acid sequence that has at least 74%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish 1463 nucleic acid sequence of SEQ ID NO:157 over at least 175, 200, 300, 400, 500, 750, 1000, 1500, 2000, 2500, or 3000 nucleotides. In a desirable embodiment, a “1463” nucleic acid sequence” is identical to the sequence of SEQ ID NO:157. Nucleic acid molecules consisting of splice variants of 1463 nucleic acid sequences, as well as 1463 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 389 and 390 of SEQ ID NO:157, are also included in this definition. By a “1463 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 1463 polypeptide or a zebrafish 1463 polypeptide (e.g., SEQ ID NO:158), or a portion thereof, as defined above. A mutation in a 1463 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 1463 expression or function, including, as examples, null mutations and mutations causing truncations.

[0318] By a “VPSP18 nucleic acid sequence” is meant a nucleic acid sequence that has at least 83%, 90%, 95%, or 98% identity to the zebrafish VPSP18 nucleic acid sequence of SEQ ID NO:21 spanning at least 50, 70, 100, 200, 300, 500, 750, 1000, 1200, or 1400 contiguous nucleic acids. In a desirable embodiment, “VPSP18 nucleic acid sequence” is identical to the sequence of SEQ ID NO:21. Nucleic acid molecules consisting of splice variants of VPSP18 nucleic acid sequences, as well as VPSP18 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 2336 of SEQ ID NO:21, are also included in this definition. A mutation in a VPSP18 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant VPSP18 expression or function, including, as examples, null mutations and mutations causing truncations.

[0319] By a “Pescadillo nucleic acid sequence” is meant a nucleic acid sequence that is identical to a zebrafish Pescadillo nucleic acid sequence, for example, that of GenBank Accession Number U77627. Nucleic acid molecules encoded by splice variants of Pescadillo nucleic acid sequences, as well as Pescadillo nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at approximately nucleotide 20 of the U77627 sequence, are also included in this definition. By a “Pescadillo nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Pescadillo polypeptide or a zebrafish Pescadillo polypeptide, or a portion thereof, as defined above. A mutation in a Pescadillo nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Pescadillo expression or function, including, as examples, null mutations and mutations causing truncations.

[0320] By a “HNF-β/vHNF1 nucleic acid sequence” is meant a nucleic acid sequence that has at least 81%, 85%, 90%, 95%, or 98% identity to the zebrafish HNF-β/vHNF1 nucleic acid sequence of SEQ ID NO:23 over at least 100, 200, 270, 300, 400, 500, 600, 700, 800, 1000, 1500, 2000, 2500, or 3000 nucleotides. In a desirable embodiment, a “HNF-β/vHNF1 nucleic acid sequence” is identical to the sequence of SEQ ID NO:23. Nucleic acid molecules consisting of splice variants of HNF-β/vHNF1nucleic acid sequences, as well as HNF-β/vHNF1 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 1682/1683, or at nucleotide 361 or 745 of SEQ ID NO:23, are also included in this definition. By a “HNF-β/vHNF1 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a HNF-β/vHNF1 polypeptide or a zebrafish HNF-β/vHNF1 polypeptide (e.g., SEQ ID NO:24), or a portion thereof, as defined above. A mutation in a HNF-β/vHNF1 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant HNF-β/vHNF1 expression or function, including, as examples, null mutations and mutations causing truncations.

[0321] By a “60S Ribosomal Protein L35 nucleic acid sequence” is meant a nucleic acid sequence that has at least 82%, 85%, 90%, 95%, or 98% identity to the zebrafish 60S Ribosomal Protein L35 nucleic acid sequence of SEQ ID NO:25 over at least 100, 200, 319, 400, ot 450 nucleotides. In a desirable embodiment, a “60S Ribosomal Protein L35 nucleic acid sequence” is identical to the sequence of SEQ ID NO:25. Nucleic acid molecules consisting of splice variants of 60S Ribosomal Protein L35 nucleic acid sequences, as well as 60S Ribosomal Protein L35 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 30 and 31 of SEQ ID NO:25, are also included in this definition. By a “60S Ribosomal Protein L35 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 60S Ribosomal Protein L35 polypeptide or a zebrafish 60S Ribosomal Protein L35 polypeptide (e.g., SEQ ID NO:26), or a portion thereof, as defined above. A mutation in a 60S Ribosomal Protein L35 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 60S Ribosomal Protein L35 expression or function, including, as examples, null mutations and mutations causing truncations.

[0322] By a “60S Ribosomal Protein L44 nucleic acid sequence” is meant a nucleic acid sequence that has at least 85%, 90%, 95%, or 98% identity to the zebrafish 60S Ribosomal Protein L44 nucleic acid sequence of SEQ ID NO:27 over at least 100, 200, 324, or 350 nucleotides. In a desirable embodiment, a “60S Ribosomal Protein L44 nucleic acid sequence” is identical to the sequence of SEQ ID NO:27. Nucleic acid molecules consisting of splice variants of 60S Ribosomal Protein L44 nucleic acid sequences, as well as 60S Ribosomal Protein L44 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 195 and 196 of the nucleic acid sequence of SEQ ID NO:27, are also included in this definition. By a “60S Ribosomal Protein L44 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 60S Ribosomal Protein L44 polypeptide or a zebrafish 60S Ribosomal Protein L44 polypeptide e.g., SEQ ID NO:28, or a portion thereof, as defined above. A mutation in a 60S Ribosomal Protein L44 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 60S Ribosomal Protein L44 expression or function, including, as examples, null mutations and mutations causing truncations.

[0323] By a “CopZ1 nucleic acid sequence” is meant a nucleic acid sequence that is, for example, identical to the zebrafish CopZ1 nucleic acid sequence of SEQ ID NO:29. In a desirable embodiment, a “CopZ1 nucleic acid sequence” is identical to the sequence of SEQ ID NO:29. Nucleic acid molecules consisting of splice variants of CopZ1 nucleic acid sequences, as well as CopZ1 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 90 and 91 of SEQ ID NO:29, are also included in this definition. By a “CopZ1 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a CopZ1 polypeptide or a zebrafish CopZ1 polypeptide (e.g., SEQ ID NO:30), or a portion thereof, as defined above. A mutation in a CopZ1 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant CopZ1expression or function, including, as examples, null mutations and mutations causing truncations.

[0324] By a “215 nucleic acid sequence” is meant a nucleic acid sequence that has at least 81%, 85%, 90%, 95% or 98% identity to the zebrafish 215 nucleic acid sequence of SEQ ID NO:31 over at least 100, 188, 300, 400, 500, 600, 700, 800, 1000, 1500, or 2000 contiguous nucleotides. In a desirable embodiment, a “215 nucleic acid sequence” is identical to the sequence of SEQ ID NO:31. Nucleic acid molecules consisting of splice variants of 215 nucleic acid sequences, as well as 215 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 294 and 295 of SEQ ID NO:31, are also included in this definition. By a “215 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 215 polypeptide or a zebrafish 215 polypeptide (e.g., SEQ ID NO:32), or a portion thereof, as defined above. A mutation in a 215 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 215 expression or function, including, as examples, null mutations and mutations causing truncations.

[0325] By a “307 nucleic acid sequence” is meant a nucleic acid sequence that has at least 94%, 96%, or 98% identity to the zebrafish 307 nucleic acid sequence of SEQ ID NO:33 over at least 34, 50, 100, 200, 300, 400, 500, 600, 700, 800, or 900 nucleotides. In a desirable embodiment, a “307 nucleic acid sequence” is identical to the sequence of SEQ ID NO:33. Nucleic acid molecules consisting of splice variants of 307 nucleic acid sequences, as well as 307 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 176 of SEQ ID NO:33, are also included in this definition. By a “307 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 307 polypeptide or a zebrafish 307 polypeptide (e.g., SEQ ID NO:34), or a portion thereof, as defined above. A mutation in a 307 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 307 expression or function, including, as examples, null mutations and mutations causing truncations.

[0326] By a “572 nucleic acid sequence” is meant a nucleic acid sequence that has at least 37%, 40%, 50%, 60%, 70%, 80%, 85%, 90%, 95%, 97%, or 99% identity to the zebrafish 572 nucleic acid sequence of SEQ ID NO:35 over at least 50, 100, 150, 200, 250, 300, 350, 400, 500, or 750 contiguous nucleotides. In a desirable embodiment, a “572 nucleic acid sequence” is identical to the sequence of SEQ ID NO:35. Nucleic acid molecules consisting of splice variants of 572 nucleic acid sequences, as well as 572 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 277 of SEQ ID NO:35, are also included in this definition. By a “572 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 572 polypeptide or a zebrafish 572 polypeptide (e.g., SEQ ID NO:36), or a portion thereof, as defined above. A mutation in a 572 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 572 expression or function, including, as examples, null mutations and mutations causing truncations.

[0327] By a “1116A nucleic acid sequence” is meant a nucleic acid sequence that has at least 42%, 45%, 50%, 60%, 70%, 80%, 85%, 90%, or 95% identity to the zebrafish 1116A nucleic acid sequence of SEQ ID NO:37 over at least 1000 nucleotides. In a desirable embodiment, a “1116A nucleic acid sequence” is identical to the sequence of SEQ ID NO:37. Nucleic acid molecules consisting of splice variants of 1116A nucleic acid sequences, as well as 1116A nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 135 of SEQ ID NO:37, are also included in this definition. By a “1116A nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 1116A polypeptide or a zebrafish 1116A polypeptide (e.g., SEQ ID NO:38), or a portion thereof, as defined above. A mutation in a 1116A nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 1116A expression or function, including, as examples, null mutations and mutations causing truncations.

[0328] By a “1548 nucleic acid sequence” is meant a nucleic acid sequence that has at least 78%, 85%, 90%, 95%, or 98% identity to the zebrafish 1548 nucleic acid sequence of SEQ ID NO:39 over at least 100, 200, 300, 400, 503, 600, 700, 800, 900, 1000, 1500, 2000, 2500, or 2900 contiguous nucleotides. In a desirable embodiment, a “1548 nucleic acid sequence” is identical to the sequence of SEQ ID NO:39. Nucleic acid molecules consisting of splice variants of 1548 nucleic acid sequences, as well as 1548 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 85 of SEQ ID NO:39, are also included in this definition. By a “1548 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 1548 polypeptide or a zebrafish 1548 polypeptide (e.g., SEQ ID NO:39), or a portion thereof, as defined above. A mutation in a 1548 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 1548 expression or function, including, as examples, null mutations and mutations causing truncations.

[0329] By a “Casein Kinase 1 α nucleic acid sequence” is meant a nucleic acid sequence that has at least 84%, 90%, 95%, or 98% identity to the zebrafish Casein Kinase 1α nucleic acid sequence of SEQ ID NO:41 over at least 250, 500, 750, 976, 1000, 1250, 1500, or 2000 nucleotides. In a desirable embodiment, a “Casein Kinase 1α nucleic acid sequence” is identical to the sequence of SEQ ID NO:41. Nucleic acid molecules consisting of splice variants of Casein Kinase 1α nucleic acid sequences, as well as Casein Kinase 1α nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 730 and 731 of SEQ ID NO:41, are also included in this definition. By a “Casein Kinase 1α nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Casein Kinase 1α polypeptide or a zebrafish Casein Kinase 1α polypeptide (e.g., SEQ ID NO:41), or a portion thereof, as defined above. A mutation in a Casein Kinase 1α nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Casein Kinase 1α expression or function, including, as examples, null mutations and mutations causing truncations.

[0330] By a “Nodal-Related (squint) nucleic acid sequence” is meant a nucleic acid sequence that is identical to a zebrafish Nodal-Related (squint) nucleic acid sequence, for example, that of SEQ ID NO:43. Nucleic acid molecules consisting of splice variants of Nodal-Related (squint) nucleic acid sequences, as well as Nodal-Related (squint) nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 654 of SEQ ID NO:43, are also included in this definition. By a “Nodal-Related (squint) nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Nodal-Related (Squint) polypeptide or a zebrafish Nodal-Related (Squint) polypeptide (e.g., SEQ ID NO:43), or a portion thereof, as defined above. A mutation in a Nodal-Related (squint) nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Nodal-Related (squint) expression or function, including, as examples, null mutations and mutations causing truncations.

[0331] By a “Smoothened nucleic acid sequence” is meant a nucleic acid sequence that is, for example, identical to the zebrafish Smoothened nucleic acid sequence of SEQ ID NO:45. Nucleic acid molecules consisting of splice variants of Smoothened nucleic acid sequences, as well as Smoothened nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 271 or 600 of SEQ ID NO:45, are also included in this definition. By a “Smoothened nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Smoothened polypeptide or a zebrafish Smoothened polypeptide (e.g., SEQ ID NO:46), or a portion thereof, as defined above. A mutation in a Smoothened nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Smoothened expression or function, including, as examples, null mutations and mutations causing truncations.

[0332] By a “429 nucleic acid sequence” is meant a nucleic acid sequence that has at least 40%, 50%, 55%, 60%, 70%, 80%, 85%, 90%, 95%, 97%, 99% identity to the zebrafish 429 nucleic acid sequence of SEQ ID NO:47 over at least 50, 100, 200, 500, 1000, 1500, 2000, or 2400 contiguous nucleotides. In a desirable embodiment, a “429 nucleic acid sequence” is identical to the sequence of SEQ ID NO:47. Nucleic acid molecules consisting of splice variants of 429 nucleic acid sequences, as well as 429 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 182 and 183 of SEQ ID NO:47, are also included in this definition. By a “429 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 429 polypeptide or a zebrafish 429 polypeptide (e.g., SEQ ID NO:48), or a portion thereof, as defined above. A mutation in a 429 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 429 expression or function, including, as examples, null mutations and mutations causing truncations.

[0333] By a “428 nucleic acid sequence” is meant a nucleic acid sequence that has at least 85%, 90%, 95%, or 98% identity to the zebrafish 428 nucleic acid sequence of SEQ ID NO:49 over at least 50, 100, 200, 300, 400, 500, 600, or 700 nucleotides. In a desirable embodiment, a “428 nucleic acid sequence” is identical to the sequence of SEQ ID NO:49. Nucleic acid molecules consisting of splice variants of 428 nucleic acid sequences, as well as 428 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 187 of SEQ ID NO:49, are also included in this definition. By a “428 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 428 polypeptide or a zebrafish 428 polypeptide (e.g., SEQ ID NO:50), or a portion thereof, as defined above. A mutation in a 428 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 428 expression or function, including, as examples, null mutations and mutations causing truncations.

[0334] By a “Spinster nucleic acid sequence” is meant a nucleic acid sequence that has at least 83%, 88%, 90%, 95%, or 98% identity to a zebrafish Spinster nucleic acid sequence, for example, that of SEQ ID NO:51. In a desirable embodiment, a “Spinster nucleic acid sequence” is identical to the sequence of SEQ ID NO:51. Nucleic acid molecules consisting of splice variants of Spinster nucleic acid sequences, as well as Spinster nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion several kb, for example, 2, 3, 4, 5, 8, 10, or 15 kb upstream of the coding region, are also included in this definition. By a “Spinster nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Spinster polypeptide or a zebrafish Spinster polypeptide (e.g., SEQ ID NO:52), or a portion thereof, as defined above. A mutation in a Spinster nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Spinster expression or function, including, as examples, null mutations and mutations causing truncations.

[0335] By a “Glypican-6 or Knypek nucleic acid sequence” is meant a nucleic acid sequence that has at least 87%, 90%, 95%, or 98% identity to the zebrafish Glypican-6 or Knypek nucleic acid sequence of SEQ ID NO:53 over at least 100, 200, 500, 750, 100, 1500, 1750, or 2000 contiguous nucleic acids. In a desirable embodiment, a “Glypican-6 or Knypek nucleic acid sequence” is identical to the sequence of SEQ ID NO:53. Nucleic acid molecules encoded by splice variants of Glypican-6 or Knypek nucleic acid sequences, as well as Glypican-6 or Knypek nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 133 or 1054 of SEQ ID NO:53, are also included in this definition. By a “Glypican-6 or Knypek nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Glypican-6 or Knypek polypeptide or a zebrafish Glypican-6 or Knypek polypeptide (e.g., SEQ ID NO:54), or a portion thereof, as defined above. A mutation in a Glypican-6 or knypek nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Glypican-6 or Knypek expression or function, including, as examples, null mutations and mutations causing truncations.

[0336] By a “Ribonucleotide Reductase Protein R1 Class 1 nucleic acid sequence” is meant a nucleic acid sequence that is identical to a zebrafish Ribonucleotide Reductase Protein R1 Class 1 nucleic acid sequence, for example, that of SEQ ID NO:55. Nucleic acid molecules consisting of splice variants of Ribonucleotide Reductase Protein R1 Class 1 nucleic acid sequences, as well as Ribonucleotide Reductase Protein R1 Class 1 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 147 and 148 of SEQ ID NO:55, are also included in this definition. By a “Ribonucleotide Reductase Protein R1 Class 1 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Ribonucleotide Reductase Protein R1 Class 1 polypeptide or a zebrafish Ribonucleotide Reductase Protein R1 Class 1 polypeptide (e.g., SEQ ID NO:56), or a portion thereof, as defined above. A mutation in a Ribonucleotide Reductase Protein R1 Class 1 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Ribonucleotide Reductase Protein R1 Class 1 expression or function, including, as examples, null mutations and mutations causing truncations.

[0337] By a “Kinesin Related Motor Protein EG5 nucleic acid sequence” is meant a nucleic acid sequence that has at least 80%, 85%, 90%, 95%, or 98% identity to the zebrafish Kinesin Related Motor Protein EG5 nucleic acid sequence of SEQ ID NO:57 over at least 250, 538, 600, 700, 800, 900, 1000, 1500, 2000, 2500, 3000, or 3500 nucleotides. In a desirable embodiment, a “Kinesin Related Motor Protein EG5 nucleic acid sequence” is identical to the sequence of SEQ ID NO:57. Nucleic acid molecules consisting of splice variants of Kinesin Related Motor Protein EG5 nucleic acid sequences, as well as Kinesin Related Motor Protein EG5 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 50 and 51 of SEQ ID NO:57, are also included in this definition. By a “Kinesin Related Motor Protein EG5 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Kinesin Related Motor Protein EG5 polypeptide or a zebrafish Kinesin Related Motor Protein EG5 polypeptide (e.g., SEQ ID NO:58), or a portion thereof, as defined above. A mutation in a Kinesin Related Motor Protein EG5 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Kinesin Related Motor Protein EG5 expression or function, including, as examples, null mutations and mutations causing truncations.

[0338] By a “Wnt5 (pipetail) nucleic acid sequence” is meant a nucleic acid sequence that is identical to a zebrafish Wnt5 (pipetail) nucleic acid sequence, for example, that of SEQ ID NO:61. Nucleic acid molecules consisting of splice variants of Wnt5 (pipetail) nucleic acid sequences, as well as Wnt5 (pipetail) nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 530 and 531 or at nucleotide 397 of SEQ ID NO:61, are also included in this definition. By a “Wnt5 (pipetail) nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Wnt5 or Pipetail polypeptide or a zebrafish Wnt5 or Pipetail polypeptide (e.g., SEQ ID NO:62), or a portion thereof, as defined above. A mutation in a Wnt5 (pipetail) nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Wnt5 or Pipetail expression or function, including, as examples, null mutations and mutations causing truncations.

[0339] By an “Aryl Hydrocarbon Receptor Nuclear Translocator 2A nucleic acid sequence” is meant a nucleic acid sequence that is identical to a zebrafish Aryl Hydrocarbon Receptor Nuclear Translocator 2A nucleic acid sequence, for example, that of SEQ ID NO:63. Nucleic acid molecules consisting of splice variants of Aryl Hydrocarbon Receptor Nuclear Translocator 2A nucleic acid sequences, as well as Aryl Hydrocarbon Receptor Nuclear Translocator 2A nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 229 or 240 of SEQ ID NO:63, are also included in this definition. By a “Aryl Hydrocarbon Receptor Nuclear Translocator 2A nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Aryl Hydrocarbon Receptor Nuclear Translocator 2A polypeptide or a zebrafish Aryl Hydrocarbon Receptor Nuclear Translocator 2A polypeptide (e.g., SEQ ID NO:64), or a portion thereof, as defined above. A mutation in a Aryl Hydrocarbon Receptor Nuclear Translocator 2A nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Aryl Hydrocarbon Receptor Nuclear Translocator 2A expression or function, including, as examples, null mutations and mutations causing truncations.

[0340] By a “Vesicular Integral-Membrane Protein VIP 36 nucleic acid sequence” is meant a nucleic acid sequence that has at least 76%, 80%, 85%, 90%, 95% or 98% identity to the zebrafish Vesicular Integral-Membrane Protein VIP 36 nucleic acid sequence of SEQ ID NO:65 over at least 100, 200, 271, 300, 400, 500, 600, 700, 800, 900, or 1000 nucleotides. In a desirable embodiment, a “Vesicular Integral-Membrane Protein VIP 36 nucleic acid sequence” is identical to the sequence of SEQ ID NO:65. Nucleic acid molecules consisting of splice variants of Vesicular Integral-Membrane Protein VIP 36 nucleic acid sequences, as well as Vesicular Integral-Membrane Protein VIP 36 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 219 and 220 of SEQ ID NO:65, are also included in this definition. By a “Vesicular Integral-Membrane Protein VIP 36 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Vesicular Integral-Membrane Protein VIP 36 polypeptide or a zebrafish Vesicular Integral-Membrane Protein VIP 36 polypeptide (e.g., SEQ ID NO:66), or a portion thereof, as defined above. A mutation in a Vesicular Integral-Membrane Protein VIP 36 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Vesicular Integral-Membrane Protein VIP 36 expression or function, including, as examples, null mutations and mutations causing truncations.

[0341] By an “299 nucleic acid sequence” is meant a nucleic acid sequence that has at least 84%, 90%, 95% or 98% identity to the zebrafish 299 nucleic acid sequence of SEQ ID NO:67 over at least 89, 250, 500, 750, 1250, 1500, 1750, or 2000 nucleotides. In a desirable embodiment, “299 nucleic acid sequence” is identical to the sequence of SEQ ID NO:67. Nucleic acid molecules consisting of splice variants of 299 nucleic acid sequences, as well as 299 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 47 and 48 of SEQ ID NO:67, are also included in this definition. By a “299 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 299 polypeptide or a zebrafish 299 polypeptide (e.g., SEQ ID NO:68), or a portion thereof, as defined above. A mutation in a 299 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 299 expression or function, including, as examples, null mutations and mutations causing truncations.

[0342] By a “994 nucleic acid sequence” is meant a nucleic acid sequence that has at least 40%, 45%, 50%, 60%, 70%, 80%, 90%, 95%, or 98% identity to the zebrafish 994 nucleic acid sequence of SEQ ID NO:69 over at least 100, 150, 200, 250, 400, 500, 750, 1000, 1250, or 1500 contiguous nucleotides. In a desirable embodiment, a “994 nucleic acid sequence” is identical to the sequence of SEQ ID NO:69. Nucleic acid molecules consisting of splice variants of 994 nucleic acid sequences, as well as 994 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 66 and 67 of SEQ ID NO:69, are also included in this definition. By a “994 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 994 polypeptide or a zebrafish 994 polypeptide (e.g., SEQ ID NO:70), or a portion thereof, as defined above. A mutation in a 994 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 994 expression or function, including, as examples, null mutations and mutations causing truncations.

[0343] By a “1373 nucleic acid sequence” is meant a nucleic acid sequence that has at least 84%, 90%, 95% or 98% identity to the zebrafish 1373 nucleic acid sequence of SEQ ID NO:71 over at least 333, 400, 450, or 500 nucleotides. In a desirable embodiment, a a “1373 nucleic acid sequence” is identical to the sequence of SEQ ID NO:71. Nucleic acid molecules consisting of splice variants of 1373 nucleic acid sequences, as well as 1373 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 118 and 119 of SEQ ID NO:71, are also included in this definition. By a “1373 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 1373 polypeptide or a zebrafish 1373 polypeptide (e.g., SEQ ID NO:72), or a portion thereof, as defined above. A mutation in a 1373 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 1373 expression or function, including, as examples, null mutations and mutations causing truncations.

[0344] By a “Denticleless nucleic acid sequence” is meant a nucleic acid sequence that has at least 79%, 85%, 90%, 95%, or 98% identity to the zebrafish Denticleless nucleic acid sequence of SEQ ID NO:73 over at least 181 nucleotides. In a desirable embodiment, a “Denticleless nucleic acid sequence” is identical to the sequence of SEQ ID NO:73. Nucleic acid molecules consisting of splice variants of Denticleless nucleic acid sequences, as well as Denticleless nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 307 and 308 of SEQ ID NO:73, are also included in this definition. By a “Denticleless nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Denticleless polypeptide or a zebrafish Denticleless polypeptide (e.g., SEQ ID NO:74), or a portion thereof, as defined above. A mutation in a Denticleless nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Denticleless expression or function, including, as examples, null mutations and mutations causing truncations.

[0345] By a “Ribonucleotide Reductase Protein R2 nucleic acid sequence” is meant a nucleic acid sequence that is identical to the zebrafish Ribonucleotide Reductase Protein R2 nucleic acid sequence, for example, that of SEQ ID NO:75. In a desirable embodiment, a “Ribonucleotide Reductase Protein R2 nucleic acid sequence” is identical to the sequence of SEQ ID NO:75. Nucleic acid molecules consisting of splice variants of Ribonucleotide Reductase Protein R2 nucleic acid sequences, as well as Ribonucleotide Reductase Protein R2 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 137 of SEQ ID NO:75 (which corresponds to position 360 of GenBank Accession No. AW280665), or at 337 or 342 of AW28066 are also included in this definition. By a “Ribonucleotide Reductase Protein R2 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Ribonucleotide Reductase Protein R2 polypeptide or a zebrafish a Ribonucleotide Reductase Protein R2 polypeptide (e.g., SEQ ID NO:76), or a portion thereof, as defined above. A mutation in a Ribonucleotide Reductase Protein R2 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Ribonucleotide Reductase Protein R2 expression or function, including, as examples, null mutations and mutations causing truncations.

[0346] By a “TCP-1 Alpha nucleic acid sequence” is meant a nucleic acid sequence that is identical to a zebrafish TCP-1 Alpha nucleic acid sequence, for example, that of SEQ ID NO:77. Nucleic acid molecules consisting of splice variants of TCP-1 Alpha nucleic acid sequences, as well as TCP-1 Alpha nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion 140 bp upstream of the coding region of between nucleotides 130 and 131 of SEQ ID NO:77, are also included in this definition. By a “TCP-1 Alpha nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a TCP-1 Alpha polypeptide or a zebrafish TCP-1 Alpha polypeptide (e.g., SEQ ID NO:78), or a portion thereof, as defined above. A mutation in a TCP-1 Alpha nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant TCP-1 Alpha expression or function, including, as examples, null mutations and mutations causing truncations.

[0347] By a “Telomeric Repeat Factor 2 nucleic acid sequence” is meant a nucleic acid sequence that has at least 31%, 40%, 50%, 60%, 75%, 85%, 90%, 95% or 98% identity to the zebrafish Telomeric Repeat Factor 2 nucleic acid sequence of SEQ ID NO:79 over at least 354, 500, 1000, 1500, 1700, 2000, or 2200 nucleotides. In a desirable embodiment, a “Telomeric Repeat Factor 2 nucleic acid sequence” is identical to the sequence of SEQ ID NO:79. Nucleic acid molecules consisting of splice variants of Telomeric Repeat Factor 2 nucleic acid sequences, as well as telomeric repeatfactor 2 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 529 and 530 of SEQ ID NO:79, are also included in this definition. By a “Telomeric Repeat Factor 2 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Telomeric Repeat Factor 2 polypeptide or a zebrafish Telomeric Repeat Factor 2 polypeptide (e.g., SEQ ID NO:80), or a portion thereof, as defined above. A mutation in a Telomeric Repeat Factor 2 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Telomeric Repeat Factor 2 expression or function, including, as examples, null mutations and mutations causing truncations.

[0348] By a “SIL nucleic acid sequence” is meant a nucleic acid sequence that has at least 79%, 85%, 90%, 95%, or 98% identity to the zebrafish SIL nucleic acid sequence of SEQ ID NO:81 over at least 96, 200, 500, 1000, 1500, 2000, 2500, 3000, 3500, or 4000 nucleotides. In a desirable embodiment, a “SIL nucleic acid sequence” is identical to the sequence of SEQ ID NO:81. Nucleic acid molecules consisting of splice variants of SIL nucleic acid sequences, as well as SIL nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 273 and 274 of SEQ ID NO:81, are also included in this definition. By a “SIL nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a SIL polypeptide or a zebrafish SIL polypeptide (e.g., SEQ ID NO:82), or a portion thereof, as defined above. A mutation in a SIL nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant SIL expression or function, including, as examples, null mutations and mutations causing truncations.

[0349] By a “U1 Small Nuclear Ribonucleoprotein C nucleic acid sequence” is meant a nucleic acid sequence that has at least 85%, 90%, 95% or 98% identity to the zebrafish U1 Small Nuclear Ribonucleoprotein C nucleic acid sequence of SEQ ID NO:83 over at least 170, 250, 300, 400, 500, 600, 700, or 750 nucleotides. In a desirable embodiment, a “U1 Small Nuclear Ribonucleoprotein C nucleic acid sequence” is identical to the sequence of SEQ ID NO:83. Nucleic acid molecules consisting of splice variants of U1 Small Nuclear Ribonucleoprotein C nucleic acid sequences, as well as U1 Small Nuclear Ribonucleoprotein C nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 52 and 53 of SEQ ID NO:83, are also included in this definition. By a “U1 Small Nuclear Ribonucleoprotein C nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a U1 Small Nuclear Ribonucleoprotein C polypeptide or a zebrafish (e.g., SEQ ID NO:84) U1 Small Nuclear Ribonucleoprotein C polypeptide, or a portion thereof, as defined above. A mutation in a U1 Small Nuclear Ribonucleoprotein C nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant U1 Small Nuclear Ribonucleoprotein C expression or function, including, as examples, null mutations and mutations causing truncations.

[0350] By a “Ski Interacting Protein (SKIP) nucleic acid sequence” is meant a nucleic acid sequence that has at least 79%, 85%, 90%, 95% or 98% identity to the zebrafish Ski Interacting Protein (SKIP) nucleic acid sequence of SEQ ID NO:85 over at least 500, 600, 700, 812, 900, 1000, 1500, or 2000 nucleotides. In a desirable embodiment, a “Ski Interacting Protein (SKIP) nucleic acid sequence” is identical to the sequence of SEQ ID NO:85. Nucleic acid molecules consisting of splice variants of Ski Interacting Protein (SKIP) nucleic acid sequences, as well as Ski Interacting Protein (SKIP) nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion 1.2 kb upstream of the coding region of the Ski Interacting Protein (SKIP) gene, are also included in this definition. By a “Ski Interacting Protein (SKIP) nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Ski Interacting Protein (SKIP) polypeptide or a zebrafish Ski Interacting Protein (SKIP) (e.g., SEQ ID NO:86), or a portion thereof, as defined above. A mutation in a Ski Interacting Protein (SKIP) nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Ski Interacting Protein (SKIP) expression or function, including, as examples, null mutations and mutations causing truncations.

[0351] By a “297 nucleic acid sequence” is meant a nucleic acid sequence that has at least 80%, 85%, 90%, 95% or 98% identity to the zebrafish 297 nucleic acid sequence of SEQ ID NO:87 over at least 173, 250, 300, 400, 500, or 600 nucleotides. In a desirable embodiment, a “297 nucleic acid sequence” is identical to the sequence of SED ID NO:87. Nucleic acid molecules encoded by splice variants of 297 nucleic acid sequences, as well as 297 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 74 of SEQ ID NO:87, are also included in this definition. By a “297 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 297 polypeptide or a zebrafish 297 polypeptide (e.g., SEQ ID NO:88), or a portion thereof, as defined above. A mutation in a 297 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 297 expression or function, including, as examples, null mutations and mutations causing truncations.

[0352] By a “TCP-1 Complex Gamma Chain nucleic acid sequence” is meant a nucleic acid sequence that has at least 76%, 80%, 85%, 90% or 95% identity to the zebrafish TCP-1 Complex Gamma Chain nucleic acid sequence of SEQ ID NO:89 over at least 250, 300, 400, 500, 600, 700, 800, 900, 1000, or 1598 nucleotides. In a desirable embodiment, a “TCP-1 Complex Gamma Chain nucleic acid sequence” is identical to the sequence of SEQ ID NO:89. Nucleic acid molecules consisting of splice variants of TCP-1 Complex Gamma Chain nucleic acid sequences, as well as TCP-1 Complex Gamma Chain nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotide 75 and 76 of SEQ ID NO:89, are also included in this definition. By a “TCP-1 Complex Gamma Chain nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a TCP-1 Complex Gamma Chain polypeptide or a zebrafish TCP-1 Complex Gamma Chain polypeptide (e.g., SEQ ID NO:90), or a portion thereof, as defined above. A mutation in a TCP-1 Complex Gamma Chain nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant TCP-1 Complex Gamma Chain expression or function, including, as examples, null mutations and mutations causing truncations.

[0353] By a “Small Nuclear Ribonucleoprotein D1 nucleic acid sequence” is meant a nucleic acid sequence that has at least 88%, 90%, 95%, or 98% identity to the zebrafish Small Nuclear Ribonucleoprotein D1 nucleic acid sequence of SEQ ID NO:91 over at least 152, 250, 300, 400, 500, 600, 700, 800, 900, 1000, or 1500 nucleotides. In a desirable embodiment, a “Small Nuclear Ribonucleoprotein D1 nucleic acid sequence” is identical to the sequence of SEQ ID NO:91. Nucleic acid molecules consisting of splice variants of Small Nuclear Ribonucleoprotein D1 nucleic acid sequences, as well as Small Nuclear Ribonucleoprotein D1 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 76 and 77 of SEQ ID NO:91, are also included in this definition. By a “Small Nuclear Ribonucleoprotein D1 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Small Nuclear Ribonucleoprotein D1polypeptide or a zebrafish Small Nuclear Ribonucleoprotein D1 polypeptide (e.g., SEQ ID NO:92), or a portion thereof, as defined above. A mutation in a Small Nuclear Ribonucleoprotein D1 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Small Nuclear Ribonucleoprotein D1 expression or function, including, as examples, null mutations and mutations causing truncations.

[0354] By a “DNA Polymerase Epsilon Subunit B nucleic acid sequence” is meant a nucleic acid sequence that has at least 70%, 75%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish DNA Polymerase Epsilon Subunit B nucleic acid sequence of SEQ ID NO:93 over at least 500, 1038, 1500, or 2000 nucleotides, or at least 79%, 85%, 90%, 95%, or 99% identity or over at least 96, 250, 300, 400, 500, 600, 700, 800, 900, 1000, 1500, or 1750 nucleotides. In a desirable embodiment, a “DNA Polymerase Epsilon Subunit B nucleic acid sequence” is identical to the sequence of SEQ ID NO:93. Nucleic acid molecules consisting of splice variants of DNA Polymerase Epsilon Subunit B nucleic acid sequences, as well as DNA Polymerase Epsilon Subunit B nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotide 1161 and 1162, or at nucleotide 929 of SEQ ID NO:93, are also included in this definition. By a “DNA Polymerase Epsilon Subunit B nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a DNA Polymerase Epsilon Subunit B polypeptide or a zebrafish DNA Polymerase Epsilon Subunit B (e.g., SEQ ID NO:94), or a portion thereof, as defined above. A mutation in a DNA Polymerase Epsilon Subunit B nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant DNA Polymerase Epsilon Subunit B expression or function, including, as examples, null mutations and mutations causing truncations.

[0355] By a “821-02 nucleic acid sequence” is meant a nucleic acid sequence that has at least 76%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish 821-02 nucleic acid sequence of SEQ ID NO:95 over at least 99, 250, 300, 400, 500, 600, 700, 800, 900, 1000, 1500, 2000, or 2500 nucleotides. In a desirable embodiment, a “821-02 nucleic acid sequence” is identical to the sequence of SEQ ID NO:95. Nucleic acid molecules consisting of splice variants of 821-02 nucleic acid sequences, as well as 821-02 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 369 and 370 or 231 and 232 of SEQ ID NO:95, are also included in this definition. By a “821-02 nucleic acid molecule” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 821-02 polypeptide or a zebrafish 821-02 polypeptide (e.g., SEQ ID NO:96), or a portion thereof, as defined above. A mutation in a 821-02 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 821-02 expression or function, including, as examples, null mutations and mutations causing truncations.

[0356] By a “1045 nucleic acid sequence” is meant a nucleic acid sequence that has at least 75%, 80%, 85%, 90%, or 95% identity to the zebrafish 1045 nucleic acid sequence of SEQ ID NO:97 over at least 250, 573, 700, 800, 900, or 1000 nucleotides. In a desirable embodiment, a “1045 nucleic acid sequence” is identical to the sequence of SEQ ID NO:97. Nucleic acid molecules consisting of splice variants of 1045 nucleic acid sequences, as well as 1045 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 216 and 344 of SEQ ID NO:97, are also included in this definition. By a “1045 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 1045 polypeptide or a zebrafish 1045 polypeptide (e.g., SEQ ID NO:98), or a portion thereof, as defined above. A mutation in a 1045 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 1045 expression or function, including, as examples, null mutations and mutations causing truncations.

[0357] By a “1055-1 nucleic acid sequence” is meant a nucleic acid sequence that has at least 74%, 80%, 85%, 90%, or 95% identity to the zebrafish 1055-1 nucleic acid sequence of SEQ ID NO:99 over at least 250, 552, 600, 700, 800, 900, 1000, 1500, or 2000 nucleotides. In a desirable embodiment, a “1055-1 nucleic acid sequence” is identical to the sequence of SEQ ID NO:99. Nucleic acid molecules consisting of splice variants of 1055-1 nucleic acid sequences, as well as 1055-1 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 167 and 168 of SEQ ID NO:99, are also included in this definition. By a “1055-1 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 1055-1 polypeptide or a zebrafish 1055-1 polypeptide (e.g., SEQ ID NO:100), or a portion thereof, as defined above. A mutation in a 1055-1 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 1055-1 expression or function, including, as examples, null mutations and mutations causing truncations.

[0358] By a “Spliceosome Associated Protein 49 nucleic acid sequence” is meant a nucleic acid sequence that has at least 78%, 80%, 85%, 90%, or 95% identity to the zebrafish Spliceosome Associated Protein 49 nucleic acid sequence of SEQ ID NO:101 over at least 250, 500, 651, 700, 800, 900, 1000, or 1200 nucleotides. In a desirable embodiment, a “Spliceosome Associated Protein 49 nucleic acid sequence” is identical to the sequence of SEQ ID NO:101. Nucleic acid molecules consisting of splice variants of Spliceosome Associated Protein 49 nucleic acid sequences, as well as Spliceosome Associated Protein 49 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 53 and 54 of SEQ ID NO:101, are also included in this definition. By a “Spliceosome Associated Protein 49 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Spliceosome Associated Protein 49 polypeptide or a zebrafish Spliceosome Associated Protein 49 polypeptide (e.g., SEQ ID NO:102), or a portion thereof, as defined above. A mutation in a Spliceosome Associated Protein 49 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Spliceosome Associated Protein 49 expression or function, including, as examples, null mutations and mutations causing truncations.

[0359] By a “DNA Replication Licensing Factor MCM7 nucleic acid sequence” is meant a nucleic acid sequence that has at least 74%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish DNA Replication Licensing Factor MCM7 nucleic acid sequence of SEQ ID NO:103 over at least 100, 200, 286, 400, 500, 600, 700, or 800 nucleotides. In a desirable embodiment, a “DNA Replication Licensing Factor MCM7 nucleic acid sequence” is identical to the sequence of SEQ ID NO:103. Nucleic acid molecules consisting of splice variants of DNA Replication Licensing Factor MCM7 nucleic acid sequences, as well as DNA Replication Licensing Factor MCM7 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 121 and 122 or at nucleotide 198 of SEQ ID NO:103, are also included in this definition. By a “DNA Replication Licensing Factor MCM7 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a DNA Replication Licensing Factor MCM7 polypeptide or a zebrafish DNA Replication Licensing Factor MCM7 polypeptide (e.g., SEQ ID NO:104), or a portion thereof, as defined above. A mutation in a DNA Replication Licensing Factor MCM7 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant DNA Replication Licensing Factor MCM7 expression or function, including, as examples, null mutations and mutations causing truncations.

[0360] By a “Dead-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid sequence” is meant a nucleic acid sequence that has at least 81%, 85%, 90%, 95%, or 98% identity to the zebrafish DNA Dead-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid sequence of SEQ ID NO:105 over at least 250, 300, 400, 500, 600, 700, 800, 810, 900, 1000, 1500, or 1750 nucleotides. In a desirable embodiment, a “Dead-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid sequence” is identical to the sequence of SEQ ID NO:105. Nucleic acid molecules encoded by splice variants of Dead-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid sequences, as well as Dead-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 132 of SEQ ID NO:105, are also included in this definition. By a “Dead-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Dead-Box RNA Helicase (DEAD5 or DEAD19) polypeptide or a zebrafish Dead-Box RNA Helicase (DEAD5 or DEAD19) polypeptide (e.g., SEQ ID NO:106), or a portion thereof, as defined above. A mutation in a Dead-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Dead-Box RNA Helicase (DEAD5 or DEAD19) expression or function, including, as examples, null mutations and mutations causing truncations.

[0361] By a “1581 nucleic acid sequence” is meant a nucleic acid sequence that has at least 77%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish 1581 nucleic acid sequence of SEQ ID NO:107 over at least 165, 250, 300, 400, 500, 600, 700, 800, 900, 1000, or 1300 nucleotides. In a desirable embodiment, a “1581 nucleic acid sequence” is identical to the sequence of SEQ ID NO:107. Nucleic acid molecules consisting of splice variants of 1581 nucleic acid sequences, as well as 1581 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 346 and 347 of SEQ ID NO:107, are also included in this definition. By a “1581 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a 1581 polypeptide or a zebrafish 1581 polypeptide (e.g., SEQ ID NO:108), or a portion thereof, as defined above. A mutation in a 1581 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 1581 expression or function, including, as examples, null mutations and mutations causing truncations.

[0362] By a “Cyclin A2 nucleic acid sequence” is meant a nucleic acid sequence that is, for example, identical to the zebrafish Cyclin A2 nucleic acid sequence of SEQ ID NO: 109. Nucleic acid molecules consisting of splice variants of Cyclin A2 nucleic acid sequences, as well as Cyclin A2 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 374 or 401 of SEQ ID NO:109, are also included in this definition. By a “Cyclin A2 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes a Cyclin A2 polypeptide or a zebrafish (e.g., SEQ ID NO:110) Cyclin A2 polypeptide, or a portion thereof, as defined above. A mutation in a Cyclin A2 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Cyclin A2 expression or function, including, as examples, null mutations and mutations causing truncations.

[0363] By an “Imitation Switch (ISWI)/SNF2 nucleic acid sequence” or “ISWI/SNF2 nucleic acid sequence” is meant a nucleic acid sequence that has at least 84%, 90%, 95%, or 98% identity to the zebrafish Imitation Switch (ISWI)/SNF2 nucleic acid sequence of SEQ ID NO:11 over at least 196, 250, 300, 400, 500, 600, 700, 800, 900, 1000, or 1100 nucleotides. In a desirable embodiment, an “Imitation Switch (ISWI)/SNF2 nucleic acid sequence” is identical to the sequence of SEQ ID NO:111. Nucleic acid molecules consisting of splice variants of Imitation Switch (ISWI)/SNF2 nucleic acid sequences, as well as Imitation Switch (ISWI)/SNF2 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 76 of SEQ ID NO:111, are also included in this definition. By an “Imitation Switch (ISWI)/SNF2 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes an Imitation Switch (ISWI)/SNF2 polypeptide or a zebrafish Imitation Switch (ISWI)/SNF2 polypeptide (e.g., SEQ ID NO:112), or a portion thereof, as defined above. A mutation in an Imitation Switch (ISWI)/SNF2 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Imitation Switch (ISWI)/SNF2 expression or function, including, as examples, null mutations and mutations causing truncations.

[0364] By a “Chromosomal Assembly Protein C (XCAP-C) nucleic acid sequence” or “XCAP-C nucleic acid sequence” is meant a nucleic acid sequence that has at least 78%, 85%, 90%, 95%, or 98% identity to the zebrafish Chromosomal Assembly Protein C (XCAP-C) nucleic acid sequence of SEQ ID NO:113 over at least 250, 500, 765, 800, 900, 1000, 1250, 1500, 2000, 2500, 3000, 3500, or 4000 nucleotides. In a desirable embodiment, a “Chromosomal Assembly Protein C (XCAP-C) nucleic acid sequence” is identical to the sequence of SEQ ID NO:113. Nucleic acid molecules consisting of splice variants of Chromosomal Assembly Protein C (XCAP-C) nucleic acid sequences, as well as Chromosomal Assembly Protein C (XCAP-C) nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 181 and 182 of SEQ ID NO:113, are also included in this definition. By a “Chromosomal Assembly Protein C (XCAP-C) nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes Chromosomal Assembly Protein C (XCAP-C) polypeptide or a zebrafish (e.g., SEQ ID NO:114) Chromosomal Assembly Protein C (XCAP-C) polypeptide, or a portion thereof, as defined above. A mutation in a Chromosomal Assembly Protein C (XCAP-C) nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Chromosomal Assembly Protein C (XCAP-C) expression or function, including, as examples, null mutations and mutations causing truncations.

[0365] By a “DNA Replication Licensing Factor MCM2 nucleic acid sequence” is meant a nucleic acid sequence that has at least 78%, 85%, 90%, 95%, or 98% identity to the zebrafish DNA Replication Licensing Factor MCM2 nucleic acid sequence of SEQ ID NO:115 over at least 500, 1000, 1164, 1500, or 2000 nucleotides. In a desirable embodiment, a “DNA Replication Licensing Factor MCM2 nucleic acid sequence” is identical to the sequence of SEQ ID NO:115. Nucleic acid molecules consisting of splice variants of DNA Replication Licensing Factor MCM2 nucleic acid sequences, as well as DNA Replication Licensing Factor MCM2 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion in the intron preceding nucleotide 399 of SEQ ID NO:115, are also included in this definition. By a “DNA Replication Licensing Factor MCM2 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes DNA Replication Licensing Factor MCM2 polypeptide or a zebrafish DNA Replication Licensing Factor MCM2 polypeptide (e.g., SEQ ID NO:116), or a portion thereof, as defined above. A mutation in a DNA Replication Licensing Factor MCM2 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant DNA Replication Licensing Factor MCM2 expression or function, including, as examples, null mutations and mutations causing truncations.

[0366] By a “DNA Replication Licensing Factor MCM3 nucleic acid sequence” is meant a nucleic acid sequence that has at least 78%, 85%, 90%, or 95% identity to the zebrafish DNA Replication Licensing Factor MCM3 nucleic acid sequence of SEQ ID NO:117 over at least 250, 400, 500, 574, or 600 nucleotides. In a desirable embodiment, a “DNA Replication Licensing Factor MCM3 nucleic acid sequence” is identical to the sequence of SEQ ID NO:117. Nucleic acid molecules consisting of splice variants of DNA Replication Licensing Factor MCM3 nucleic acid sequences, as well as DNA Replication Licensing Factor MCM3 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 50 or between nucleotides 75 and 76 of SEQ ID NO:117, are also included in this definition. By a “DNA Replication Licensing Factor MCM3 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes DNA Replication Licensing Factor MCM3 polypeptide or a zebrafish DNA Replication Licensing Factor MCM3 polypeptide (e.g., SEQ ID NO:118), or a portion thereof, as defined above. A mutation in a DNA Replication Licensing Factor MCM3 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant DNA Replication Licensing Factor MCM3 expression or function, including, as examples, null mutations and mutations causing truncations.

[0367] By a “Valyl-tRNA Synthase nucleic acid sequence” is meant a nucleic acid sequence that has at least 76%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish Valyl-tRNA Synthase nucleic acid sequence of SEQ ID NO:119 over at least 519, 550, 600, 650, 750, 1000, 1500, 2000, 2500, or 2900 contiguous nucleotides. In a desirable embodiment, a “Valyl-tRNA Synthase nucleic acid sequence” is identical to the sequence of SEQ ID NO:119. Nucleic acid molecules consisting of splice variants of Valyl-tRNA Synthase nucleic acid sequences, as well as Valyl-tRNA Synthase nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 30 and 31 of the nucleic acid sequence of SEQ ID NO:119, are also included in this definition. By a “Valyl-tRNA Synthase nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes Valyl-tRNA Synthase polypeptide or a zebrafish Valyl-tRNA Synthase polypeptide (e.g., SEQ ID NO:120), or a portion thereof, as defined above. A mutation in a Valyl-tRNA Synthase nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Valyl-tRNA Synthase expression or function, including, as examples, null mutations and mutations causing truncations.

[0368] By a “40S Ribosomal Protein S5 nucleic acid sequence” is meant a nucleic acid sequence that has at least 84%, 90%, 95%, or 98% identity to the zebrafish 40S Ribosomal Protein S5 nucleic acid sequence of SEQ ID NO:121 over at least 593 or 645 nucleotides. In a desirable embodiment, a “40S Ribosomal Protein S5 nucleic acid sequence” is identical to the sequence of SEQ ID NO:121. Nucleic acid molecules consisting of splice variants of 40S Ribosomal Protein S5 nucleic acid sequences, as well as 40S Ribosomal Protein S5 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 31 and 32 of SEQ ID NO:121, are also included in this definition. By a “40S Ribosomal Protein S5 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes 40S Ribosomal Protein S5 polypeptide or a zebrafish 40S Ribosomal Protein S5 polypeptide (e.g., SEQ ID NO:122), or a portion thereof, as defined above. A mutation in a 40S Ribosomal Protein S5 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 40S Ribosomal Protein S5 expression or function, including, as examples, null mutations and mutations causing truncations.

[0369] By a “TCP-1 Beta nucleic acid sequence” is meant a nucleic acid sequence that has at least 74%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish TCP-1 Beta nucleic acid sequence of SEQ ID NO:123 over at least 500, 750, 1000, or 1100 nucleotides. In a desirable embodiment, a “TCP-1 Beta nucleic acid sequence” is identical to the sequence of SEQ ID NO:123. Nucleic acid molecules consisting of splice variants of TCP-1 Beta nucleic acid sequences, as well as TCP-1 Beta nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 63 and 64 of SEQ ID NO:123, are also included in this definition. By a “TCP-1 Beta nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes TCP-1 Beta polypeptide or a zebrafish TCP-1 Beta polypeptide (e.g., SEQ ID NO:124), or a portion thereof, as defined above. A mutation in a TCP-1 Beta nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant TCP-1 Beta expression or function, including, as examples, null mutations and mutations causing truncations.

[0370] By a “TCP-1 Eta nucleic acid sequence” is meant a nucleic acid sequence that has at least 79%, 85%, 90%, 95%, or 98% identity to the zebrafish TCP-1 Eta nucleic acid sequence of SEQ ID NO:125 over at least 1584, 1750, or 2000 nucleotides. In a desirable embodiment, a “TCP-1 Eta nucleic acid sequence” is identical to the sequence of SEQ ID NO:125. Nucleic acid molecules consisting of splice variants of TCP-1 Eta nucleic acid sequences, as well as TCP-1 Eta nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 32 and 33 of SEQ ID NO:125, are also included in this definition. By a “TCP-1 Eta nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes TCP-1 Eta polypeptide or a zebrafish TCP-1 Eta polypeptide (e.g., SEQ ID NO:126), or a portion thereof, as defined above. A mutation in a TCP-1 Eta nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant TCP-1 Eta expression or function, including, as examples, null mutations and mutations causing truncations.

[0371] By a “Translation Elongation Factor eEF1 Alpha nucleic acid sequence” is meant a nucleic acid sequence that is identical to a zebrafish Translation Elongation Factor eEF1 Alpha nucleic acid sequence, for example, the nucleic acid sequence of SEQ ID NO:127. Nucleic acid molecules consisting of splice variants of Translation Elongation Factor eEF1 Alpha nucleic acid sequences, as well as Translation Elongation Factor eEF1 Alpha nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 60 and 61 of SEQ ID NO:127, are also included in this definition. By a “Translation Elongation Factor eEF1 Alpha nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes Translation Elongation Factor eEF1 Alpha polypeptide or a zebrafish Translation Elongation Factor eEF1 Alpha polypeptide (e.g., SEQ ID NO:128), or a portion thereof, as defined above. A mutation in a Translation Elongation Factor eEF1 Alpha nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Translation Elongation Factor eEF1 Alpha expression or function, including, as examples, null mutations and mutations causing truncations.

[0372] By a “1257 nucleic acid sequence” is meant a nucleic acid sequence that has at least 40%, 45%, 50%, 55%, 60%, 70%, 80%, 90%, 95%, or 98% identity to the zebrafish 1257 nucleic acid sequence of SEQ ID NO:129 over at least 100, 150, 200, 250, 300, 400, 500, 600, 750, 1000, 1250, 1500, 1750, or 1800 contiguous nucleotides. In a desirable embodiment, a “1257 nucleic acid sequence” is identical to the sequence of SEQ ID NO:129. Nucleic acid molecules consisting of splice variants of 1257 nucleic acid sequences, as well as 1257 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 175 of SEQ ID NO:129, are also included in this definition. By a “1257 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes 1257 polypeptide or a zebrafish 1257 polypeptide (e.g., SEQ ID NO:130), or a portion thereof, as defined above. A mutation in a 1257 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 1257 expression or function, including, as examples, null mutations and mutations causing truncations.

[0373] By a “60S Ribosomal Protein L24 nucleic acid sequence” is meant a nucleic acid sequence that has at least 82%, 85%, 90%, 95%, or 98% identity to the zebrafish 60S Ribosomal Protein L24 nucleic acid sequence of SEQ ID NO:131 over at least 250, 363, 400, 500, or 565 nucleotides. In a desirable embodiment, a “60S Ribosomal Protein L24 nucleic acid sequence” is identical to the sequence of SEQ ID NO:131. Nucleic acid molecules consisting of splice variants of 60S Ribosomal Protein L24 nucleic acid sequences, as well as 60S Ribosomal Protein L24 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 144 and 145 of the nucleic acid sequence of SEQ ID NO:131, are also included in this definition. By a “60S Ribosomal Protein L24 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes 60S Ribosomal Protein L24 polypeptide or a zebrafish 60S Ribosomal Protein L24 polypeptide (e.g., SEQ ID NO:132), or a portion thereof, as defined above. A mutation in a 60S Ribosomal Protein L24 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 60S Ribosomal Protein L24 expression or function, including, as examples, null mutations and mutations causing truncations.

[0374] By a “Non-Muscle Adenylosuccinate Synthase nucleic acid sequence” is meant a nucleic acid sequence that has at least 76%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish Non-Muscle Adenylosuccinate Synthase nucleic acid sequence of SEQ ID NO:133 over at least 250, 333, 400, 500, 600, 700, 800, 900, 1000, 1250, 1500, or 1700 nucleotides. In a desirable embodiment, a “Non-Muscle Adenylosuccinate Synthase nucleic acid sequence” is identical to the sequence of SEQ ID NO:133. Nucleic acid molecules consisting of splice variants of Non-Muscle Adenylosuccinate Synthase nucleic acid sequences, as well as Non-Muscle Adenylosuccinate Synthase nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 209 or between nucleotides 217 and 218 of SEQ ID NO:133, are also included in this definition. By a “Non-Muscle Adenylosuccinate Synthase nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes Non-Muscle Adenylosuccinate Synthase polypeptide or a zebrafish Non-Muscle Adenylosuccinate Synthase polypeptide (e.g., SEQ ID NO:134), or a portion thereof, as defined above. A mutation in a Non-Muscle Adenylosuccinate Synthase nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Non-Muscle Adenylosuccinate Synthase expression or function, including, as examples, null mutations and mutations causing truncations.

[0375] By a “Nuclear Cap Binding Protein Subunit 2 nucleic acid sequence” is meant a nucleic acid sequence that has at least 74%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish Nuclear Cap Binding Protein Subunit 2 nucleic acid sequence of SEQ ID NO:135 over at least 390, 500, 600, 700, or 740 nucleotides. In a desirable embodiment, a “Nuclear Cap Binding Protein Subunit 2 nucleic acid sequence” is identical to the sequence of SEQ ID NO:135. Nucleic acid molecules consisting of splice variants of Nuclear Cap Binding Protein Subunit 2 nucleic acid sequences, as well as Nuclear Cap Binding Protein Subunit 2 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 137 and 138 of SEQ ID NO:135, are also included in this definition. By a “Nuclear Cap Binding Protein Subunit 2 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes Nuclear Cap Binding Protein Subunit 2 polypeptide or a zebrafish Nuclear Cap Binding Protein Subunit 2 polypeptide (e.g., SEQ ID NO:136), or a portion thereof, as defined above. A mutation in a Nuclear Cap Binding Protein Subunit 2 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Nuclear Cap Binding Protein Subunit 2 expression or function, including, as examples, null mutations and mutations causing truncations.

[0376] By a “Ornithine Decarboxylase nucleic acid sequence” is meant a nucleic acid sequence that is identical to a zebrafish Ornithine Decarboxylase nucleic acid sequence, for example, that of SEQ ID NO:137. Nucleic acid molecules consisting of splice variants of Ornithine Decarboxylase nucleic acid sequences, as well as Ornithine Decarboxylase nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 97 and 98 of SEQ ID NO:137, are also included in this definition. By a “Ornithine Decarboxylase nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes Ornithine Decarboxylase polypeptide or a zebrafish Ornithine Decarboxylase polypeptide (e.g., SEQ ID NO:138), or a portion thereof, as defined above. A mutation in an Ornithine Decarboxylase nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Ornithine Decarboxylase expression or function, including, as examples, null mutations and mutations causing truncations.

[0377] By a “Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) nucleic acid sequence” is meant a nucleic acid sequence that has at least 76%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) nucleic acid sequence of SEQ ID NO:139 over at least 240, 300, 400, 500, 600, 700, 800, 900, 1000, 1500, or 2000 nucleotides. In a desirable embodiment, a “Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) nucleic acid sequence” is identical to the sequence of SEQ ID NO:139. Nucleic acid molecules consisting of splice variants of Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) nucleic acid sequences, as well as Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 303 of SEQ ID NO:139, are also included in this definition. By a “Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) polypeptide or a zebrafish Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) polypeptide (e.g., SEQ ID NO:140), or a portion thereof, as defined above. A mutation in a Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) expression or function, including, as examples, null mutations and mutations causing truncations.

[0378] By a “Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid sequence” is meant a nucleic acid sequence that has at least 76%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid sequence of SEQ ID NO:141 over at least 250, 416, 500, 600, 700, 800, 900, 1000, or 1250 nucleic acids. In a desirable embodiment, a “Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid sequence” is identical to the sequence of SEQ ID NO:141. Nucleic acid molecules consisting of splice variants of Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid sequences, as well as Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 100 of SEQ ID NO:141, are also included in this definition. By a “Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes Mitochondrial Inner Membrane Translocating Protein (rTIM23) polypeptide or a zebrafish Mitochondrial Inner Membrane Translocating Protein (rTIM23) polypeptide (e.g., SEQ ID NO:142), or a portion thereof, as defined above. A mutation in a Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Mitochondrial Inner Membrane Translocating Protein (rTIM23) expression or function, including, as examples, null mutations and mutations causing truncations.

[0379] By a “1447 nucleic acid sequence” is meant a nucleic acid sequence that has at least 76%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish 1447 nucleic acid sequence of SEQ ID NO:143 over at least 500, 750, 910, 1000, 1250, 1500, 2000, 2500, or 2800 nucleic acids. In a desirable embodiment, a “1447 nucleic acid sequence” is identical to the sequence of SEQ ID NO:143. Nucleic acid molecules consisting of splice variants of 1447 nucleic acid sequences, as well as 1447 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 227 and 228 of SEQ ID NO:143, are also included in this definition. By a “1447 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes 1447 polypeptide or a zebrafish 1447 polypeptide (e.g., SEQ ID NO:144), or a portion thereof, as defined above. A mutation in a 1447 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant 1447 expression or function, including, as examples, null mutations and mutations causing truncations.

[0380] By an “ARS2 nucleic acid sequence” is meant a nucleic acid sequence that has at least 75%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish ARS2 nucleic acid sequence of SEQ ID NO:145 over at least 250, 500, 614, 750, 1000, 1250, 1500, 2000, or 2400 nucleic acids. In a desirable embodiment, an “ARS2 nucleic acid sequence” is identical to the sequence of SEQ ID NO:145. Nucleic acid molecules consisting of splice variants of ARS2 nucleic acid sequences, as well as ARS2 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 103 and 104 of SEQ ID NO:143, are also included in this definition. By an “ARS2 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes ARS2 polypeptide or a zebrafish ARS2 polypeptide (e.g., SEQ ID NO:146), or a portion thereof, as defined above. A mutation in an ARS2 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant ARS2 expression or function, including, as examples, null mutations and mutations causing truncations.

[0381] By a “Sec61 Alpha nucleic acid sequence” is meant a nucleic acid sequence that is identical to a zebrafish Sec61 Alpha nucleic acid sequence, for example, that of SEQ ID NO:147. Nucleic acid molecules consisting of splice variants of Sec61 Alpha nucleic acid sequences, as well as Sec61 Alpha nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 132 and 133 of SEQ ID NO:147, are also included in this definition. By a “Sec61 Alpha nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes Sec61 alpha polypeptide or a zebrafish Sec61 alpha polypeptide (e.g., SEQ ID NO:148), or a portion thereof, as defined above. A mutation in a Sec61 Alpha nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Sec61 Alpha expression or function, including, as examples, null mutations and mutations causing truncations.

[0382] By a “BAF53a nucleic acid sequence” is meant a nucleic acid sequence that has at least 77%, 85%, 90%, 95%, or 98% identity to the zebrafish BAF53a nucleic acid sequence of SEQ ID NO:149 over at least 500, 750, 1000, 1288, 1500, or 1800 nucleic acids. In a desirable embodiment, a “BAF53a nucleic acid sequence” is identical to the sequence of SEQ ID NO:149. Nucleic acid molecules consisting of splice variants of BAF53a nucleic acid sequences, as well as BAF53a nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 160 of SEQ ID NO:149, are also included in this definition. By a “BAF53a nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes BAF53a polypeptide or a zebrafish BAF53a polypeptide (e.g., SEQ ID NO:150), or a portion thereof, as defined above. A mutation in a BAF53a nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant BAF53a expression or function, including, as examples, null mutations and mutations causing truncations.

[0383] By a “Histone Deacetylase nucleic acid sequence” is meant a nucleic acid sequence that has at least 81%, 85%, 90%, 95%, or 98% identity to the zebrafish Histone Deacetylase nucleic acid sequence of SEQ ID NO:151 over at least 500, 750, 1000, 1250, 1406, 1500, or 2000 nucleic acids. In a desirable embodiment, a “Histone Deacetylase nucleic acid sequence” is identical to the sequence of SEQ ID NO:151. Nucleic acid molecules consiting of splice variants of Histone Deacetylase nucleic acid sequences, as well as Histone Deacetylase nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion at nucleotide 88 or between nucleotides 98 and 99 of SEQ ID NO:151, are also included in this definition. By a “Histone Deacetylase nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes Histone Deacetylase polypeptide or a zebrafish Histone Deacetylase polypeptide (e.g., SEQ ID NO:152), or a portion thereof, as defined above. A mutation in a Histone Deacetylase nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Histone Deacetylase expression or function, including, as examples, null mutations and mutations causing truncations.

[0384] By a “Fibroblast Isoform of the ADP/ATP Carrier Protein nucleic acid sequence” is meant a nucleic acid sequence that has at least 83%, 87%, 90%, 95%, or 98% identity to the zebrafish Fibroblast Isoform of the ADP/ATP Carrier Protein nucleic acid sequence of SEQ ID NO:153 over at least 500, 750, 886, 1000, or 1200 nucleic acids. In a desirable embodiment, a “Fibroblast Isoform of the ADP/ATP Carrier Protein nucleic acid sequence” is identical to the sequence of SEQ ID NO:153. Nucleic acid molecules consisting of splice variants of Fibroblast Isoform of the ADP/ATP Carrier Protein nucleic acid sequences, as well as Fibroblast Isoform of the ADP/ATP Carrier Protein nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 178 and 179 of SEQ ID NO:153, are also included in this definition. By a “Fibroblast Isoform of the ADP/ATP Carrier Protein nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes Fibroblast Isoform of the ADP/ATP Carrier Protein polypeptide or a zebrafish Fibroblast Isoform of the ADP/ATP Carrier Protein polypeptide (e.g., SEQ ID NO:154), or a portion thereof, as defined above. A mutation in a Fibroblast Isoform of the ADP/ATP Carrier Protein nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant Fibroblast Isoform of the ADP/ATP Carrier Protein expression or function, including, as examples, null mutations and mutations causing truncations.

[0385] By a “TAFII-55 nucleic acid sequence” is meant a nucleic acid sequence that has at least 75%, 80%, 85%, 90%, 95%, or 98% identity to the zebrafish TAFII-55 nucleic acid sequence of SEQ ID NO:155 over at least 250, 559, 750, 900, 1000, 1250, or 1400 nucleic acids. In a desirable embodiment, a “TAFII-55 nucleic acid sequence” is identical to the sequence of SEQ ID NO:155. Nucleic acid molecules consisting of splice variants of TAFII-55 nucleic acid sequences, as well as TAFII-55 nucleic acid sequences containing a mutation, for example, a mutation resulting from the insertion of a virus, e.g., an insertion between nucleotides 107 and 108 of SEQ ID NO:155, are also included in this definition. By a “TAFII-55 nucleic acid sequence” is also meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA (e.g., mRNA) molecule, that encodes TAFII-55 polypeptide or a zebrafish TAFII-55 polypeptide (e.g., SEQ ID NO:154), or a portion thereof, as defined above. A mutation in a TAFII-55 nucleic acid molecule can be characterized, for example, by the insertion of a retrovirus into the nucleic acid sequence, for example, using methods described herein. In addition, the invention includes mutations that result in aberrant TAFII-55 expression or function, including, as examples, null mutations and mutations causing truncations.

[0386] By an “alteration” or a “mutation” in reference to a nucleic acid sequence is meant a change in the nucleic acid sequence relative to that of a wild-type sequence. Such a change may include, for example, a substitution of one nucleotide for another, an inversion, a deletion or insertion of one or more nucleic acids, or a duplication of one or more nucleic acids. In reference to an amino acid sequence, an “alteration”, or “mutation” includes a change in the amino acid sequence relative to that of a wild-type organism. Such a change in an amino acid sequence may be, for example, a substitution of one amino acid for another, a deletion or insertion of one or more amino acids, or a duplication of one or more amino acids.

[0387] By an “alteration in level,” in reference to a nucleic acid molecule or an amino acid molecule, for example, a 904, POU2, 40S Ribosomal Protein S18, U2AF, 954, Nrp-1, Cad-1, V-ATPase alpha subunit, V-ATPase SFD subunit, V-ATPase 16 kDa proteolytic subunit, 1463, VPSP18, Pescadillo, HNF1-β/vHNF1, 60S Ribosomal Protein L35, 60S Ribosomal Protein L44, CopZ1, 215, 307, 572, 1116A, 1548, Casein Kinase 1 α, Nodal-Related or Squint, Smoothened, 429, 428, Spinster, Glypican-6 or Knypek, Ribonucleotide Reductase Protein R1 Class 1, Kinesin-Related Motor Protein EG5, 459, Wnt5 or Pipetail, Aryl Hydrocarbon Receptor Nuclear Translocator 2A, Vesicular Integral-Membrane Protein VIP 36, 299, 994, 1373, Denticleless, Ribonucleotide Reductase Protein R2, TCP-1 Alpha, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonucleoprotein C, Ski Interacting Protein, 297, TCP-1 Complex Gamma Chain, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 821-02, 1045, 1055-01, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, Cyclin A2, ISWI/SNF2, Chromosomal Assembly Protein C (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Beta, TCP-1 Eta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS), Mitochondrial Inner Membrane Translocating Protein (rTIM23), 1447, ARS2, Sec61 Alpha, BAF53a, Histone Deacetylase, Fibroblast Isoform of the ADP/ATP Carrier Protein, or a TAFII-55 polypeptide or nucleic acid sequence is meant a change, such as an increase or a decrease in expression level of an endogenous (e.g., a naturally-occuring sequence) or heterologous nucleic acid or amino acid sequence.

[0388] Desirably, such an increase or decrease in the expression of a nucleic acid or amino acid sequence is, for example, at least 20%, 40%, 50%, 60%, 70%, or 80%. In more desirable embodiments, the decrease or increase may be, for example, 90%, 95%, or even 100%. Thus, a decrease may be a complete lack of expression of a nucleic acid or amino acid sequence. Further, an increase in the expression of a nucleic acid or amino acid sequence may be, for example, 2-fold, 3-fold, 5-fold, or even 10-fold. For instance, one can detect an alteration in the level of a nucleic acid by amplifying the sequence, or part thereof, using standard techniques such as quantitative Polymerase Chain Reaction (PCR) analysis, hybridization analysis, gel electrophoresis, Northern blots, Southern blots, and spectrophotometric assays. Alternatively, an alteration in the level of an amino acid sequence may be detected, for example, by using an antibody specific for this amino acid sequence and performing a Western blot. In addition, amino acid levels may be detected using Bradford assays and spectrophotometric assays.

[0389] By an “alteration in sequence,” in reference to a nucleic acid or amino acid sequence, for example, a 904, POU2, 40S Ribosomal Protein S18, U2AF, 954, Nrp-1, Cad-1, V-ATPase alpha subunit, V-ATPase SFD subunit, V-ATPase 16 kDa proteolytic subunit, 1463, VPSP18, Pescadillo, HNF1-β/vHNF1, 60S Ribosomal Protein L35, 60S Ribosomal Protein L44, CopZ1, 215, 307, 572, 1116A, 1548, Casein Kinase 1 α, Nodal-Related or Squint, Smoothened, 429, 428, Spinster, Glypican-6 or Knypek, Ribonucleotide Reductase Protein R1 Class 1, Kinesin-Related Motor Protein EG5, 459, Wnt5 or Pipetail, Aryl Hydrocarbon Receptor Nuclear Translocator 2A, Vesicular Integral-Membrane Protein VIP 36, 299, 994, 1373, Denticleless, Ribonucleotide Reductase Protein R2, TCP-1 Alpha, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonucleoprotein C, Ski Interacting Protein, 297, TCP-1 Complex Gamma Chain, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 821-02, 1045, 1055-01, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, Cyclin A2, ISWI/SNF2, Chromosomal Assembly Protein C (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Beta, TCP-1 Eta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS), Mitochondrial Inner Membrane Translocating Protein (rTIM23), 1447, ARS2, Sec61 Alpha, BAF53a, Histone Deacetylase, Fibroblast Isoform of the ADP/ATP Carrier Protein, or a TAFII-55 amino acid or nucleic acid sequence is meant a change, for example, one resulting from a mutation in an endogenous (e.g., a naturally-occuring sequence) nucleic acid sequence. For example, one can detect an alteration in a nucleic acid sequence, or part thereof, by Restriction Fragment Length Polymorphism (“RFLP”) analysis and by amplifying the sequence, or a fragment thereof, using standard techniques such as the Polymerase Chain Reaction (“PCR”) and determining its sequence using standard DNA sequencing protocols. In addition, an alteration in an amino acid sequence may be detected, for example, using standard peptide sequencing protocols.

[0390] By “anti-sense,” as used herein in reference to a nucleic acid sequence, is meant a nucleic acid sequence, regardless of length, that is complementary to the coding strand or mRNA of a nucleic acid sequence. Desirably the anti-sense nucleic acid is capable of decreasing the expression or biological activity of a nucleic acid or amino acid sequence. In a desirable embodiment, the decrease in expression or biological activity is at least 10%, relative to a control, more desirably 25%, and most desirably 50% or more. The anti-sense nucleic acid may contain a modified backbone, for example, phosphorothioate, phosphorodithioate, or other modified backbones known in the art, or may contain non-natural internucleoside linkages.

[0391] By “biological activity” is meant any activity that is caused by a nucleic acid or amino acid sequence either in vivo or in vitro. For example, the biological activity of a 459 amino acid or nucleic acid sequence may be regulation of kidney development or function or regulation of cell proliferation.

[0392] By a “candidate compound” or “test compound” is meant a chemical, be it naturally-occurring or artificially-derived, that is surveyed for its ability to modulate the biological activity of a nucleic acid or amino acid molecule, by employing one of the assay methods described herein. Candidate compounds may include, for example, peptides, polypeptides, synthetic organic molecules, naturally-occurring organic molecules, nucleic acid molecules, and components thereof.

[0393] By “high stringency conditions” is meant conditions that allow hybridization comparable with the hybridization that occurs using a DNA probe of at least 300, 400, or 500 nucleotides in length, in a buffer containing, for example, 0.5 M NaHPO₄, pH 7.2, 7% SDS, 1 mM EDTA, and 1% BSA (fraction V), at a temperature of about 65° C., or a buffer containing, for example, 48% formamide, 4.8×SSC, 0.2 M Tris-Cl, pH 7.6, 1× Denhardt's solution, 10% dextran sulfate, and 0.1% SDS, at a temperature of about 42° C. (These are typical conditions for high stringency Northern or Southern hybridizations.) High stringency hybridization may also be used in numerous techniques routinely performed by molecular biologists, such as high stringency PCR, DNA sequencing, single strand conformational polymorphism analysis, and in situ hybridization. In contrast to Northern and Southern hybridizations, these techniques are usually performed with relatively short probes (e.g., usually 16 nucleotides or longer for PCR or sequencing, and 40 nucleotides or longer for in situ hybridization). The high stringency conditions used in these techniques are well known to those skilled in the art of molecular biology, and examples of them can be found, for example, in Ausubel et al. (Current Protocols in Molecular Biology, John Wiley & Sons, New York, 2000), which is hereby incorporated by reference.

[0394] The term “identity” is used herein to describe the relationship of the sequence of a particular nucleic acid molecule or polypeptide to the sequence of a reference molecule of the same type. For example, if a polypeptide or a nucleic acid molecule has the same amino acid or nucleotide residue at a given position, compared to a reference molecule to which it is aligned, there is said to be “identity” at that position. The level of sequence identity of a nucleic acid molecule or a polypeptide to a reference molecule is typically measured using sequence analysis software with the default parameters specified therein, such as the introduction of gaps to achieve an optimal alignment (e.g., EditSeq™ or MegAlign™ (DNASTAR, Inc. 1993-2001), Sequence Analysis Software Package of the Genetics Computer Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, Wis. 53705, BLAST, or PILEUP/PRETTYBOX programs). These software programs match identical or similar sequences by assigning degrees of identity to various substitutions, deletions, or other modifications. Conservative substitutions typically include substitutions within the following groups: glycine, alanine, valine, isoleucine, and leucine; aspartic acid, glutamic acid, asparagine, and glutamine; serine and threonine; lysine and arginine; and phenylalanine and tyrosine.

[0395] By “isolated nucleic acid molecule” is meant a nucleic acid molecule, e.g., a DNA molecule, that is free of the nucleic acid sequence(s) which, in the naturally-occurring genome of the organism from which the nucleic acid molecule of the invention is derived, flank the nucleic acid molecule. The term therefore includes, for example, a recombinant DNA that is incorporated into a vector; into an autonomously replicating plasmid or virus; or into the genomic DNA of a prokaryote or eukaryote; or that exists as a separate molecule (for example, a cDNA or a genomic or cDNA fragment produced by PCR or restriction endonuclease digestion) independent of other sequences. The term “isolated nucleic acid molecule” also includes a recombinant DNA which is part of a hybrid gene encoding additional polypeptide sequence.

[0396] By a “kidney disorder,” as used herein is meant abnormal development, structure, or function of a kidney. Such a disorder may be congentital or it may be acquired during the life of an organism, e.g., a human. For example, a “kidney disorder” may result in the formation of fluid-filled sacs, or cysts, in a kidney. Moreover, either one or both of the kidneys may be affected by the disorder. Examples of “kidney disorders” include polycystic kidney disease, multicystic kidney disease, malformation of the kidney, Bardet-Biedl syndrome, kidney failure, acute renal failure, nephrolithiasis, congenital nephritic syndrome, kidney infection, and kidney stones.

[0397] By a “part” or “fragment,” in reference to a nucleic acid sequence is meant a stretch of 10 or more contiguous nucleic acids. In desirable embodiments, a part refers to a stretch of 20, 25, 30, 40, 50, 75, or 100 contiguous nucleic acids. In other desirable embodiments, a part is a stretch of 200, 300, 500, or 1000 contiguous nucleic acids and may include the entire coding region of a gene.

[0398] By a “part” of “fragment” in reference to an amino acid sequence is meant a stretch of 4 or more contiguous amino acids. In desirable embodiments, a part refers to a stretch of 10, 15, 25, 50, 75, or 100 contiguous amino acids. In other desirable embodiments, a part is a stretch of 200, 300, 500, or 1000 contiguous amino acids.

[0399] By “probe” or “primer” is meant a single-stranded nucleic acid sequence, for example, a DNA or RNA molecule, of defined sequence that can base pair to a second nucleic acid sequence that contains a complementary sequence (“target”). The stability of the resulting hybrid depends upon the extent of the base pairing that occurs. This stability is affected by parameters such as the degree of complementarity between the probe and target molecule, and the degree of stringency of the hybridization conditions. The degree of hybridization stringency is affected by parameters such as the temperature, salt concentration, and concentration of organic molecules, such as formamide, and is determined by methods that are well known to those skilled in the art. Probes or primers specific for a 904, POU2, 40S Ribosomal Protein S18, U2AF, 954, Nrp-1, Cad-1, V-ATPase Alpha Subunit, V-ATPase SFD subunit, V-ATPase 16 kDa proteolytic subunit, 1463, VPSP18, pescadillo, HN1F-β/vHNF1, 60S Ribosomal Protein L35, 60S Ribosomal Protein L44, CopZ1, 215, 307, 572, 1116A, 1548, Casein Kinase 1 a, Nodal-Related or Squint, Smoothened, 429, 428, Spinster, Glypican-6 or Knypek, Ribonucleotide Reductase Protein R1 Class 1, Kinesin-Related Motor Protein EG5, 459, Wnt5 or Pipetail, Aryl Hydrocarbon Receptor Nuclear Translocator 2A, Vesicular Integral-Membrane Protein VIP 36, 299, 994, 1373, Denticleless, Ribonucleotide Reductase Protein R2, TCP-1 Alpha, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonucleoprotein C, Ski Interacting Protein, 297, TCP-1 Complex Gamma Chain, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 821-02, 1045, 1055-01, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, Cyclin A2, ISWI/SNF2, Chromosomal Assembly Protein C (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Beta, TCP-1 Eta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS), Mitochondrial Inner Membrane Translocating Protein (rTIM23), 1447, ARS2, Sec61 Alpha, BAF53a, Histone Deacetylase, Fibroblast Isoform of the ADP/ATP Carrier Protein, or a TAFII-55 nucleic acid sequences, desirably have greater than 45%, 55%, 65%, 75%, or 85% identity, in more desirable embodiments such probes have sequence identity that is at least 85-99%, and may even be identical to a fragment of or the whole length of the sequence provided in any one of SEQ ID NOS:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, or 157.

[0400] Probes can be detectably-labeled, either radioactively or non-radioactively, by methods that are well-known to those skilled in the art (see, for example, Ausubel et al. (Current Protocols in Molecular Biology, John Wiley & Sons, New York, 2000)). Probes can be used for methods involving nucleic acid hybridization, such as nucleic acid sequencing, nucleic acid amplification by the polymerase chain reaction, single stranded conformational polymorphism (SSCP) analysis, restriction fragment polymorphism (RFLP) analysis, Southern hybridization, Northern hybridization, in situ hybridization, electrophoretic mobility shift assay (EMSA), and other methods that are well known to those skilled in the art.

[0401] A molecule, e.g., an oligonucleotide probe or primer, a gene or fragment thereof, a cDNA molecule, a polypeptide, or an antibody, can be said to be “detectably-labeled” if it is marked in such a way that its presence can be directly identified in a sample. Methods for detectably-labeling molecules are well known in the art and include, without limitation, radioactive labeling (e.g., with an isotope, such as ³²P or ³⁵S) and non-radioactive labeling (e.g., with a fluorescent label, such as fluorescein).

[0402] By “polypeptide” or “polypeptide fragment” is meant a chain of two or more amino acids, regardless of any post-translational modification (e.g., glycosylation or phosphorylation), constituting all or part of a naturally or non-naturally occurring polypeptide. By “post-translational modification” is meant any change to a polypeptide or polypeptide fragment during or after synthesis. Post-translational modifications can be produced naturally (such as during synthesis within a cell) or generated artificially (such as by recombinant or chemical means). A “protein” may be made up of one or more polypeptides.

[0403] By “sample” is meant a tissue biopsy, amniotic fluid, cell, blood, serum, urine, stool, or other specimen from a patient or a test subject. The sample can be analyzed to detect a mutation in, or a change in expression level of, a 904, POU2, 40S Ribosomal Protein S18, U2AF, 954, Nrp-1, Cad-1, V-ATPase Alpha Subunit, V-ATPase SFD Subunit, V-ATPase 16 kDa Proteolytic Subunit, 1463, VPSP18, Pescadillo, HNF1-β/vHNF1, 60S Ribosomal Protein L35, 60S Ribosomal Protein L44, CopZ1, 215, 307, 572, 1116A, 1548, Casein Kinase 1 α, Nodal-Related or Squint, Smoothened, 429, 428, Spinster, Glypicari-6 or Knypek, Ribonucleotide Reductase Protein R1 Class 1, Kinesin-Related Motor Protein EG5, 459, Wnt5 or Pipetail, Aryl Hydrocarbon Receptor Nuclear Translocator 2A, Vesicular Integral-Membrane Protein VIP 36, 299, 994, 1373, Denticleless, Ribonucleotide Reductase Protein R2, TCP-1 Alpha, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonucleoprotein C, Ski Interacting Protein, 297, TCP-1 Complex Gamma Chain, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 821-02, 1045, 1055-01, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, Cyclin A2, ISWI/SNF2, Chromosomal Assembly Protein C (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Beta, TCP-1 Eta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS), Mitochondrial Inner Membrane Translocating Protein (rTIM23), 1447, ARS2, Sec61 Alpha, BAF53a, Histone Deacetylase, Fibroblast Isoform of the ADP/ATP Carrier Protein, or a TAFII-55 gene. For example, methods such as sequencing, single-strand conformational polymorphism (SSCP) analysis, or restriction fragment length polymorphism (RFLP) analysis of PCR products derived from a patient sample can be used to detect a mutation in an above-listed gene; ELISA can be used to measure levels of a polypeptide encoded by an above-listed gene; and PCR can be used to measure the level of an above-listed gene or nucleic acid sequence.

[0404] An antibody is said to “specifically bind” to a polypeptide if it recognizes and binds to the polypeptide (e.g., a 904, POU2, 40S Ribosomal Protein S18, U2AF, 954, Nrp-1, Cad-1, V-ATPase Alpha Subunit, V-ATPase SFD Subunit, V-ATPase 16 kDa Proteolytic Subunit, 1463, VPSP18, Pescadillo, HNF1-β/vHNF1, 60S Ribosomal Protein L35, 60S Ribosomal Protein L44, CopZ1, 215, 307, 572, 1116A, 1548, Casein Kinase 1 α, Nodal-Related or Squint, Smoothened, 429, 428, Spinster, Glypican-6 Knypek, Ribonucleotide Reductase Protein R1 Class 1, Kinesin-Related Motor Protein EG5, 459, WNT5 or Pipetail, Aryl Hydrocarbon Receptor Nuclear Translocator 2A, Vesicular Integral-Membrane Protein VIP 36, 299, 994, 1373, Denticleless, Ribonucleotide Reductase Protein R2, TCP-1 Alpha, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonucleoprotein C, Ski Interacting Protein, 297, TCP-1 Complex Gamma Chain, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 821-02, 1045, 1055-01, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, Cyclin A2, ISWI/SNF2, Chromosomal Assembly Protein C (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Beta, TCP-1 Eta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS), Mitochondrial Inner Membrane Translocating Protein (rTIM23), 1447, ARS2, Sec61 Alpha, BAF53a, Histone Deacetylase, Fibroblast Isoform of the ADP/ATP Carrier Protein, or a TAFII-55 polypeptide), but does not substantially recognize and bind to other, molecules in a sample, e.g., a biological sample that naturally includes the polypeptide.

[0405] A nucleic acid sequence or polypeptide is said to be “substantially identical” to a reference molecule if it exhibits, over its entire length, at least 51%, desirably at least 55%, 60%, or 65%, and in more desirable embodiments 75%, 85%, 90%, 95%, 98%, or 99% identity to the sequence of the reference molecule. For polypeptides, the length of comparison sequences is at least 16 amino acids, desirably at least 20, 30, 40, 50, 75, or 100 amino acids, and in more desirable embodiments at least 125, 150, 175, 200, 250, 300, 400, 500, 600, 700, 800, or 1000 amino acids. For nucleic acid sequences, the length of comparison sequences is at least 50 nucleotides, desirably at least 60, 90, 120, 150, 225, or 300 nucleotides, and in more desirable embodiments at least 375, 450, 525, 600, 750, 900, 1200, 1500, 2100, 2400, 2700, or 3000 nucleotides.

[0406] By a “substantially pure polypeptide” or “isolated polypeptide” is meant a polypeptide (or a fragment thereof) that has been separated from proteins and organic molecules that naturally accompany it. Typically, a polypeptide is substantially pure when it is at least 60%, by weight, free from the proteins and naturally-occurring organic molecules with which it is naturally associated. Preferably, the polypeptide is a 904, POU2, 40S Ribosomal Protein S18, U2AF, 954, Nrp-1, Cad-1, V-ATPase Alpha Subunit, V-ATPase SFD Subunit, V-ATPase 16 kDa Proteolytic Subunit, 1463, VPSP18, Pescadillo, HNF1-β/vHNF1, 60S Ribosomal Protein L35, 60S Ribosomal Protein L44, CopZ1, 215, 307, 572, 1116A, 1548, Casein Kinase 1 a, Nodal-Related or Squint, Smoothened, 429, 428, Spinster, Glypican-6 or Knypek, Ribonucleotide Reductase Protein R1 Class 1, Kinesin-Related Motor Protein EG5, 459, Wnt5 or Pipetail, Aryl Hydrocarbon Receptor Nuclear Translocator 2A, Vesicular Integral-Membrane Protein VIP 36, 299, 994, 1373, Denticleless, Ribonucleotide Reductase Protein R2, TCP-1 Alpha, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonucleoprotein C, Ski Interacting Protein, 297, TCP-1 Complex Gamma Chain, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 821-02, 1045, 1055-01, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, Cyclin A2, ISWI/SNF2, Chromosomal Assembly Protein C (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Beta, TCP-1 Eta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS), Mitochondrial Inner Membrane Translocating Protein (rTIM23), 1447, ARS2, Sec61 Alpha, BAF53a, Histone Deacetylase, Fibroblast Isoform of the ADP/ATP Carrier Protein, or a TAFII-55 polypeptide that is at least 75%, in a more desirable embodiment at least 90%, and in another desirable embodiment at least 99%, by weight, pure. A substantially pure or isolated 904, POU2, 40S Ribosomal Protein S18, U2AF, 954, Nrp-1, Cad-1, V-ATPase Alpha Subunit, V-ATPase SFD subunit, V-ATPase 16 kDa Proteolytic Subunit, 1463, VPSP18, Pescadillo, HNF1-β/vHNF1, 60S Ribosomal Protein L35, 60S Ribosomal Protein L44, CopZ1, 215, 307, 572, 1116A, 1548, Casein Kinase 1 α, Nodal-Related or Squint, Smoothened, 429, 428, Spinster, Glypican-6 or Knypek Ribonucleotide Reductase Protein R1 Class 1, Kinesin-Related Motor Protein EG5, 459, Wnt5 or Pipetail, Aryl Hydrocarbon Receptor Nuclear Translocator 2A, Vesicular Integral-Membrane Protein VIP 36, 299, 994, 1373, Denticleless, Ribonucleotide Reductase Protein R2, TCP-1 Alpha, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonucleoprotein C, Ski Interacting Protein, 297, TCP-1 Complex Gamma Chain, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 821-02, 1045, 1055-01, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, Cyclin A2, ISWI/SNF2, Chromosomal Assembly Protein C (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Beta, TCP-1 Eta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS), Mitochondrial Inner Membrane Translocating Protein (rTIM23), 1447, ARS2, Sec61 Alpha, BAF53a, Histone Deacetylase, Fibroblast Isoform of the ADP/ATP Carrier Protein, or TAFII-55 polypeptide can be obtained, for example, by extraction from a natural source (e.g., zebrafish or mammalian tissue), by expression of a recombinant nucleic acid sequence encoding an above-listed polypeptide, or by chemical synthesis. Purity can be measured by any appropriate method, e.g., by column chromatography, polyacrylamide gel electrophoresis, or HPLC analysis.

[0407] A polypeptide is substantially free of naturally associated components when it is separated from those proteins and organic molecules that accompany it in its natural state. Thus, a protein that is chemically synthesized or produced in a cellular system that is different from the cell in which it is naturally produced is substantially free from its naturally associated components. Accordingly, substantially pure polypeptides not only include those that are derived from eukaryotic organisms, but also those synthesized in E. coli or other prokaryotes.

[0408] By “transgene” is meant any piece of DNA which is inserted by artifice into a cell and typically becomes part of the genome of the organism which develops from that cell. Such a transgene may include a gene which is partly or entirely heterologous (i.e., foreign) to the transgenic organism, or may represent a gene homologous to an endogenous gene of the organism.

[0409] By “transgenic” is meant any cell which includes a DNA sequence which is inserted by artifice into a cell and becomes part of the genome of the organism which develops from that cell. As used herein, the transgenic organisms are generally transgenic vertebrates, such as, zebrafish, mice, and rats, and the DNA (transgene) is inserted by artifice into the nuclear genome.

BRIEF DESCRIPTION OF THE DRAWINGS

[0410]FIG. 1 is a schematic diagram of the protocol for the large-scale mutagenesis screen.

[0411]FIG. 2 shows Southern Blots of eight fish from two different F1 families (FIGS. 2A and 2B).

[0412]FIG. 3 is a schematic diagram of the structure of the provirus along with the position of the Southern blot probes and PCR primers.

[0413]FIG. 4A is a scanned image of four-day old zebrafish embryos, lateral view. A wild-type embryo is shown at the top, and an embryo containing a mutation in the 904 nucleic acid sequence is shown at the bottom.

[0414]FIG. 4B is a scanned image of three-day old zebrafish embryos, ventral view. A wild-type embryo is shown at the top, and an embryo containing a mutation in the 904 nucleic acid sequence is shown at the bottom.

[0415]FIG. 5 is a scanned image of three-day old zebrafish embryos. A wild-type embryo is shown at the top and an embryo containing a mutation in the POU2 nucleic acid sequence is shown at the bottom of the panel.

[0416]FIG. 6A is a scanned image of two-day old zebrafish embryos. A wild type embryo is at the top and an embryo containing a mutation in the 40S Ribosomal Protein S18 nucleic acid sequence is at the bottom.

[0417]FIG. 6B is a scanned image of four two-day old zebrafish embryos. The embryo on the right is wild-type and the other three embros contain a mutation in the 40S ribosomal protein S18 nucleic acid sequence.

[0418]FIG. 7A is a scanned image of four two-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top. Embryos containing a mutation in the U2AF nucleic acid sequence are shown below.

[0419]FIG. 7B is a scanned image of a two-day old zebrafish embryo, lateral view. This embryo contains a mutation in the U2AF nucleic acid sequence.

[0420]FIG. 8A is a scanned image of four-day old zebrafish embryos, ventral view. A wild-type embryo is shown at the top. The embryo at the bottom of the panel contains a mutation in the 954 nucleic acid sequence.

[0421]FIG. 8B is a scanned image of four-day old zebrafish embryos, lateral view. The embryo at the bottom of the panel contains a mutation in the 954 nucleic acid sequence.

[0422]FIG. 9A is a scanned image of five-day old zebrafish embryos, lateral view. The embryo at the bottom of the panel contains a mutation in the Nrp-1 gene.

[0423]FIG. 9B is a scanned image of five day old zebrafish embryos stained with Alcian blue, lateral view. The embryo at the bottom of the panel contains a mutation in the Nrp-1 nucleic acid sequence.

[0424]FIG. 10 is a scanned image of two-day old zebrafish embryos, lateral view. The embryo at the bottom of the panel contains a mutation in the Cad-1 gene. Cad-1 is a caudal homeobox zinc finger homolog.

[0425]FIG. 11A is a scanned image of a five-day old zebrafish embryo that contains a mutation in the V-ATPase Alpha Subunit nucleic acid sequence, dorsal view.

[0426]FIG. 11B is a scanned image of a five-day old wild-type zebrafish embryo, dorsal view.

[0427]FIG. 12A is a scanned image of three four-day old zebrafish embryos, lateral view. The embryo at the top of the panel is wild-type. The other embryos contain mutations in the V-ATPase SFD Subunit nucleic acid sequence.

[0428]FIG. 12B is a scanned image of a four-day old zebrafish embryo, lateral view. This embryo contains a mutation in the V-ATPase SFD Subunit nucleic acid sequence and shows brain necrosis.

[0429]FIG. 13A is a scanned image of two-day old zebrafish embryos, lateral view. The embryo at the top of the panel is wild-type. The embryo at the bottom of the panel contains a mutation in the V-ATPase 16 kDa Proteolytic Subunit nucleic acid sequence.

[0430]FIG. 13B is a scanned image of three-day old zebrafish embryos, lateral views. The embryo at the top of the panel is wild-type. The embryo at the bottom of the panel contains a mutation in the V-ATPase16 kDa Proteolytic Subunit nucleic acid sequence.

[0431]FIG. 14A is a scanned image of two-day old zebrafish embryos, lateral views. The embryo at the top of the panel is wild-type. The embryo at the bottom of the panel contains a mutation in the 1463 nucleic acid sequence which is a CD36-like Transmembrane Receptor.

[0432]FIG. 14B is a scanned image of six one-day old zebrafish embryos. The embryo in the upper-left hand corner is wild-type. The other embryos contain mutations in the 1463 nucleic acid sequence which is a CD36-Like Transmembrane Receptor.

[0433]FIG. 15A is a scanned image of three five-day old embryos, lateral views. The embryo at the top of the panel is wild-type. The other embryos contain mutations in the VPSP18 nucleic acid sequence.

[0434]FIG. 15B is a scanned image of five-day old embryos, lateral views. The embryo at the top of the panel is wild-type. The other embryos contain mutations in the VPSP18 nucleic acid sequence.

[0435]FIG. 16A is a scanned image of a four-day old zebrafish embryo. This embryo contains a mutation in the HNF1-β/vHNF1 gene. The mesonephros have ballooned out, and there is a bulge in a duct at chevron eight.

[0436]FIG. 16B is a scanned image of a four-day old zebrafish embryo. This embryo contains a mutation in the HNF1-β/vHNF1 gene.

[0437]FIG. 17A is a scanned image of a two-day old zebrafish embryo. This embryo contains a mutation in the 60S Ribosomal Protein L35 gene and has abnormal somites.

[0438]FIG. 17B is a scanned image of two day old zebrafish embryos. The embryo at the top of the panel is wild type. The embryo at the bottom of the panel contains a mutation in the 60S Ribosomal Protein L35 nucleic acid sequence.

[0439]FIG. 18A is a scanned image of a two-day old wild-type zebrafish embryo, lateral view.

[0440]FIG. 18B is a scanned image of a two-day old zebrafish embryo, lateral view. This embryo contains a mutation in the 60S Ribosomal Protein L44 nucleic acid sequence.

[0441]FIG. 19A is a scanned image of a three-day old wild-type zebrafish embryo, lateral view.

[0442]FIG. 19B is a scanned image of a three-day old zebrafish embryo, lateral view. This embryo contains a mutation in the CopZ1 nucleic acid sequence.

[0443]FIG. 20 is a scanned image of three-day old zebrafish embryos. The embryo at the top is wild-type. The other embryos contain mutations in the 215 nucleic acid sequence that encodes an ATP-dependent RNA helicase.

[0444]FIG. 21A is a scanned image of six-day old zebrafish embryos stained with Alcian blue, ventral views. The embryo at the top of the panel is wild-type, the embryo at the bottom of the panel contains a mutation in the 307 nucleic acid sequence which encodes Beta-1,3-Glucuronyltransferase.

[0445]FIG. 21B is a scanned image of six-day old zebrafish embryos stained with Alcian blue, lateral views. The embryo at the top is wild-type. The embryo at the bottom contain a mutation in the 307 nucleic acid sequence which encodes a Beta-1,3-Glucuronyltransferase.

[0446]FIG. 22A is a scanned image of four-day old zebrafish embryos, lateral views. The embryo at the top of the panel is wild-type. The embryo at the bottom of the panel contains a mutation in the 572 nucleic acid sequence.

[0447]FIG. 22B is a scanned image of a four-day old zebrafish embryo, ventral view. The embryo at the top of the panel is wild-type. The embryo at the bottom of the panel contains a mutation in the 572 nucleic acid sequence.

[0448]FIG. 23A is a scanned image of a six-day old wild-type zebrafish embryo, lateral view.

[0449]FIG. 23B is a scanned image of a six-day old zebrafish embryo, lateral view. The embryo contains a mutation in the 1116A nucleic acid sequence.

[0450]FIG. 24A is a scanned image of five-day old zebrafish embryos, lateral views. The embryo at the top of the panel is wild-type. The two embryos below it contain mutations in the 1548 nucleic acid sequence.

[0451]FIG. 24B is a scanned image of five-day old zebrafish embryos, ventral views. The embryo at the top of the panel is wild-type. The two embryos below it contain mutations in the 1548 nucleic acid sequence.

[0452]FIG. 25A is a scanned image of four-day old zebrafish embryos stained with Alcian blue, ventral views. The embryo at the top of the panel is wild-type. The embryo at the bottom of the panel contains a mutation in Casein Kinasel a nucleic acid sequence that has been identified as a Casein Kinase 1 a isoform.

[0453]FIG. 25B is a scanned image of three four-day old zebrafish embryos, ventral view. The embryos below contain mutations in the Casein Kinase 1 a nucleic acid sequence.

[0454]FIG. 26 is a scanned image of a five-day old zebrafish embryo, ventral view. The embryo contains a mutation in the Nodal-related (Squint) nucleic acid sequence.

[0455]FIG. 27A is a scanned image of three-day old zebrafish embryos, ventral views. The embryo at the top of the panel is wild-type. The embryo at the bottom of the panel contains a mutation in the Smoothened nucleic acid sequence.

[0456]FIG. 27B is a scanned image of two-day old zebrafish embryos, lateral views. The embryo at the top of the panel is wild-type. The embryo at the bottom of the panel contains a mutation in the Smoothened nucleic acid sequence.

[0457]FIG. 28A is a scanned image of six-day old zebrafish embryos, lateral views. The embryo at the upper-lefthand corner of the panel is wild-type. The other embryos in the panel contain mutations in the 429 nucleic acid sequence.

[0458]FIG. 28B is a scanned image of a six-day old zebrafish embryo, lateral view. This embryo contains a mutation in the 429 nucleic acid sequence, and displays a defect in the lower jaw.

[0459]FIG. 29A is a scanned image of a four-day old wild-type zebrafish embryo, shown in indirect light.

[0460]FIG. 29B is a scanned image of a four-day old 428 mutant embryo under indirect light.

[0461]FIG. 30 is a scanned image of one-day old zebrafish embryos, lateral posterior views. The embryo at the top of the panel is wild-type. The embryo at the bottom of the panel contains a mutation in the Glypican-6 or Knypek nucleic acid sequence.

[0462]FIG. 31A is a scanned image of six-day old zebrafish embryos, ventral views. The embryo at the top of the panel is wild-type. The embryo at the bottom of the panel contains a mutation in the nucleic acid sequence that encodes Ribonucleotide Reductase Protein R1 Class 1.

[0463]FIG. 31B is a scanned image of one-day old zebrafish embryos, lateral views. The embryo at the top of the panel is wild-type. The embryo at the bottom of the panel contains a mutation in the nucleic acid sequence that encodes Ribonucleotide Reductase Protein R1 Class 1.

[0464]FIG. 32 is a scanned image of three-day old zebrafish embryos, dorsal views. The embryo at the top of the panel is wild-type. The embryo at the bottom of the panel contains a mutation in the Kinesin-Related Motor Protein EGS nucleic acid sequence.

[0465]FIG. 33A is a scanned image of one-day old zebrafish embryos, lateral views. The embryo at the top of the panel is wild-type. The embryo at the bottom of the panel contains a mutation in the 459 nucleic acid sequence.

[0466]FIG. 33B is a scanned image of two-day old zebrafish embryos, lateral views. The embryo at the top of the panel is wild-type. The embryos at the bottom of the panel contain a mutation in the 459 nucleic acid sequence.

[0467]FIG. 34A is a scanned image of three-day old zebrafish embryos, lateral views. The embryo at the top of the panel is wild-type. The embryos at the bottom of the panel contain mutations in the Wnt5 (Pipetail) nucleic acid sequence.

[0468]FIG. 34B is a scanned image of three-day old zebrafish embryos, dorsal views. The embryo at the top of the panel is wild-type. The embryo at the bottom of the panel contains a mutation in the Wnt5 (Pipetail) nucleic acid sequence.

[0469]FIG. 35 is a scanned image of five-day old zebrafish embryos. A wild-type embryo is shown at the top and an embryo containing a mutation in the Aryl Hydrocarbon Receptor Nuclear Translocator 2A nucleic acid sequence is shown at the bottom of the panel.

[0470]FIG. 36 is a scanned image of five-day old zebrafish embryos. A wild-type embryo is shown at the top and two embryos containing a mutation in the Vesicular Integral Membrane Protein (VIP 36) nucleic acid sequence are shown at the bottom of the panel.

[0471]FIG. 37A is a scanned image of four-day old zebrafish embryos, ventral views. The embryo at the left of the panel is wild-type. The embryo at the right of the panel contains a mutation in the 299 nucleic acid sequence.

[0472]FIG. 37B is a scanned image of four-day old zebrafish embryos stained with Alcian blue, ventral views. The embryo at the left of the panel is wild-type. The embryo at the right of the panel contains a mutation in the 299 nucleic acid sequence. The mutant displays defects that include no jaw and branchial arches, as well as small fins.

[0473]FIG. 38A is a scanned image of a four-day old wild-type zebrafish embryo, lateral view.

[0474]FIG. 38B is a scanned image of a four-day old zebrafish embryo, lateral view, containing a mutation in the 994 nucleic acid sequence.

[0475]FIG. 39A is a scanned image of a two-day old wild-type zebrafish embryo, lateral view.

[0476]FIG. 39B is a scanned image of a two-day old zebrafish embryo, lateral view, containing a mutation in the 1373 nucleic acid sequence

[0477]FIG. 40A is a scanned image of two-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the Denticleless nucleic acid sequence is shown at the bottom of the panel.

[0478]FIG. 40B is a scanned image of a two-day old zebrafish embryo, lateral view, containing a mutation in the Denticleless nucleic acid sequence.

[0479]FIG. 41A is a scanned image of two-day old zebrafish embryos, lateral view. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the Ribonucleoside Reductase Protein R2 nucleic acid sequence is shown at the bottom of the panel.

[0480]FIG. 41B is a scanned image of a two-day old zebrafish embryo, lateral posterior view. The posterior of a wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the Ribonucleotide Reductase Protein R2 nucleic acid sequence is shown at the bottom of the panel.

[0481]FIG. 42A is a scanned image of a four-day old zebrafish embryo, lateral view. This embryo, containing a mutation in the TCP-1 Alpha nucleic acid sequence, displays a small head and heart edema.

[0482]FIG. 42B is a scanned image of a wild-type four-day old zebrafish embryo, lateral view.

[0483]FIG. 43 is a scanned image of two-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the Telomeric Repeat Factor 2 nucleic acid sequence is shown at the bottom of the panel.

[0484]FIG. 44 is a scanned image of three-day old zebrafish embryos. A wild-type embryo is shown at the top and an embryo containing a mutation in the SIL nucleic acid sequence is shown at the bottom of the panel.

[0485]FIG. 45A is a scanned image of two-day old zebrafish embryos, lateral view of the midbody. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the U1 Small Nuclear Ribonucleoprotein C nucleic acid sequence is shown at the bottom of the panel.

[0486]FIG. 45B is a scanned image of two-day old zebrafish embryo, lateral view. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the U1 Small Nuclear Ribonucleoprotein C nucleic acid sequence is shown at the bottom of the panel.

[0487]FIG. 46A is a scanned image of a one-day old zebrafish embryo, lateral view. A wild-type embryo is shown at the top of the panel, note that the mid/hind-brain barrier is obvious, and brain structures clearly visible.

[0488]FIG. 46B is a scanned image of a one-day old zebrafish embryo, lateral view, with a mutation in the Ski Interacting Protein (SKIP) nucleic acid sequence. This mutant embryo has a small head due to extensive brain necrosis.

[0489]FIG. 47A is a scanned image of three-day old zebrafish embryos, lateral views. An embryo containing a mutation in the 297 nucleic acid sequence is shown at the top of the panel, note the flattened head, brain containing yellow debris, and large yolk sac. A wild-type embryo is shown at the bottom of the panel.

[0490]FIG. 47B is a scanned image of three-day old zebrafish embryos, dorsal views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the 297 nucleic acid sequence is shown at the bottom of the panel. The branchial arches appear abnormal.

[0491]FIG. 48 is a scanned image of three-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the TCP-1 Complex Gamma Chain nucleic acid sequence is shown at the bottom of the panel.

[0492]FIG. 49A is a scanned image of a two-day old zebrafish embryo, lateral view. A wild-type embryo is shown.

[0493]FIG. 49B is a scanned image of a two-day old zebrafish embryo, lateral view, containing a mutation in the nucleic acid sequence that encodes the Small Nuclear Ribonucleoprotein D1.

[0494]FIG. 50 is a scanned image of three-day old zebrafish embryos. A wild-type embryo is shown at the top and an embryo containing a mutation in the DNA Polymerase Epsilon Subunit B nucleic acid sequence is shown at the bottom of the panel.

[0495]FIG. 51 is a series of scanned images of zebrafish embryos containing a mutation in an 821-02 nucleic acid sequence, lateral views. FIG. 51A shows a one-day old embryo and FIG. 51B shows the posterior of two-day old embryos.

[0496]FIG. 52 is a scanned image of one-day old zebrafish embryos. A wild-type embryo is shown at the top-left corner of this panel, the other three embryos contain a mutation in the 1045 nucleic acid sequence.

[0497]FIG. 53A is a scanned image of three-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and embryos containing mutations in the 1055-1 nucleic acid sequence are shown below. The 1055-1 gene encodes a MAK16 homolog.

[0498]FIG. 53B is a scanned image of three-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the 1055-1 nucleic acid sequence is shown below. The 1055-1 gene encodes a MAK16 homolog.

[0499]FIG. 54A is a scanned image of a two-day old zebrafish embryo, lateral view. A wild-type embryo is shown.

[0500]FIG. 54B is a scanned image of a two-day old zebrafish embryo, lateral view, containing a mutation in the Spliceosome Associated Protein 49 nucleic acid.

[0501]FIG. 55 is a scanned image of three-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the DNA Replication Licensing Factor MCM7 nucleic acid sequence is shown at the bottom of the panel.

[0502]FIG. 56A is a scanned image of a one-day old wild-type zebrafish embryo, lateral view.

[0503]FIG. 56B is a scanned image of a one-day old zebrafish embryo, lateral view. This embryo contains a mutation in the Dead-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid sequence.

[0504]FIG. 57A is a scanned image of one-day old zebrafish embryos, lateral views. A wild-type embryo is shown in the left-hand panel, and an embryo containing a mutation in the 1581 nucleic acid sequence is shown in the right-hand panel.

[0505]FIG. 57B is a scanned image of one-day old zebrafish embryos, ventral views. A wild-type embryo is shown in the left-hand panel, and an embryo containing a mutation in the 1581 nucleic acid sequence is shown in the right-hand panel.

[0506]FIG. 58 is a scanned image of four-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and embryos containing mutations in the Cyclin A2 nucleic acid sequence are shown below.

[0507]FIG. 59 is a scanned image of two-day old zebrafish embryos. A wild-type embryo is shown at the top and two embryos containing a mutation in the ISWI/SNF nucleic acid sequence are shown at the bottom of the panel.

[0508]FIG. 60 is a scanned image of four-day old zebrafish embryos, lateral views. A wild-type embryo is show at the top of the panel and an embryo containing a mutation in the XCAP-C nucleic acid sequence is shown at the bottom.

[0509]FIG. 61A is a scanned image of three-day old zebrafish embryos, dorsolateral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the DNA Replication Licensing Factor MCM2 nucleic acid sequence is shown at the bottom of the panel.

[0510]FIG. 61B is a scanned image of four-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the DNA Replication Licensing Factor MCM2 nucleic acid sequence is shown at the bottom of the panel.

[0511]FIG. 62A is a scanned image of a four-day old zebrafish embryo, lateral view. An embryo containing a mutation in the DNA Replication Licensing Factor MCM3 nucleic acid sequence is shown.

[0512]FIG. 62B is a scanned image of a four-day old wild-type zebrafish embryo, lateral view.

[0513]FIG. 63A is a scanned image of three-day old zebrafish embryos, ventral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the Valyl-tRNA Synthase nucleic acid sequence is shown at the bottom of the panel. Note the delay in jaw development displayed by the mutant embryo.

[0514]FIG. 63B is a scanned image of four-day old zebrafish embryos, ventral views, stained with Alcian blue. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the Valyl-tRNA Synthase nucleic acid sequence is shown at the bottom of the panel.

[0515]FIG. 64A is a scanned image of a two-day old wild-type zebrafish embryo, lateral view.

[0516]FIG. 64B is a scanned image of a two-day old zebrafish embryo, lateral view. An embryo containing a mutation in the 40S Ribosomal Protein S5 nucleic acid sequence is shown.

[0517]FIG. 65A is a scanned image of five-day old zebrafish embryos stained with Alcian blue, ventral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the TCP-1 Beta nucleic acid sequence is shown at the bottom of the panel.

[0518]FIG. 65B is a scanned image of four-day old zebrafish embryos, ventral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the TCP-1 Beta nucleic acid sequence is shown at the bottom of the panel.

[0519]FIG. 66A is a scanned image of three-day old zebrafish embryos, dorsal views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the TCP-1 Eta nucleic acid sequence is shown at the bottom of the panel.

[0520]FIG. 66B is a scanned image of five-day old zebrafish embryos, lateral views. This image shows the muscles of a wild-type embryo at the top of the panel, and a mutant embryo containing a mutation in the TCP-1 Eta nucleic acid sequence is shown at the bottom of the panel.

[0521]FIG. 67 is a series of scanned images of three-day old zebrafish embryos. A lateral view of a wild-type embryo is shown in FIG. 67A, a lateral view of an embryo containing a mutation in the Translation Elongation Factor eEF1 Alpha nucleic acid sequence is shown in FIG. 67B, and ventral views of a wild-type (top) and Translation Elongation Factor eEF1 Alpha mutant (bottom) embryos are shown in FIG. 67C.

[0522]FIG. 68A is a scanned image of a five-day old zebrafish embryo, ventral view. An embryo containing a mutation in the 1257 nucleic acid sequence is shown.

[0523]FIG. 68B is a scanned image of a five-day old zebrafish embryo, ventral view.

[0524]FIG. 69A is a scanned image of three-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the 60S Ribosomal Protein L24 nucleic acid sequence is shown at the bottom of the panel.

[0525]FIG. 69B is a scanned image of three-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the 60S Ribosomal Protein L24 nucleic acid sequence is shown at the bottom of the panel.

[0526]FIG. 70A is a scanned image of three-day old zebrafish embryos, ventral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the Non-Muscle Adenylosuccinate Synthase nucleic acid sequence is shown at the bottom of the panel.

[0527]FIG. 70B is a scanned image of four-day old zebrafish embryos, ventral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the Non-Muscle Adenylosuccinate Synthase nucleic acid sequence is shown at the bottom of the panel.

[0528]FIG. 71A is a scanned image of four-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and embryos containing mutations in the Nuclear-Cap Binding Protein Subunit 2 nucleic acid sequence are shown below it.

[0529]FIG. 71B is a scanned image of four-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the Nuclear-Cap Binding Protein Subunit 2 nucleic acid sequence is shown at the bottom of the panel.

[0530]FIG. 72 is a scanned image of six-day old zebrafish embryos. A wild-type embryo is shown at the top and two embryos containing a mutation in the Ornithine Decarboxylase nucleic acid sequence are shown at the bottom of the panel.

[0531]FIG. 73A is a scanned image of a four-day old zebrafish embryo, dorsal view. A wild-type embryo is shown.

[0532]FIG. 73B is a scanned image of a four-day old zebrafish embryo, dorsal view. An embryo containing a mutation in the Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) nucleic acid sequence is shown at the bottom of the panel.

[0533]FIG. 74A is a scanned image of three-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the left of the panel, and an embryo containing a mutation in the Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid sequence is shown at the right of the panel. The mutant embryo displays smaller lighter eyes than the wild-type embryo. The mutant also displays pooling of blood around the heart, and some mutants have slower tail circulation.

[0534]FIG. 74B is a scanned image of three-day old zebrafish embryos, dorsal views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid sequence is shown at the bottom of the panel.

[0535]FIG. 75A is a scanned image of three-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and embryos containing mutations in the 1447 nucleic acid sequence are shown below.

[0536]FIG. 75B is a scanned image of four-day old zebrafish embryos, ventral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the 1447 nucleic acid sequence is shown at the bottom of the panel.

[0537]FIG. 76A is a scanned image of three-day old zebrafish embryos, dorsal views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the ARS2 nucleic acid sequence is shown at the bottom of the panel.

[0538]FIG. 76B is a scanned image of a five-day old zebrafish embryo, lateral view. This embryo contains a mutation in the ARS2 nucleic acid sequence

[0539]FIG. 77A is a scanned image of three-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the Sec61 Alpha nucleic acid sequence is shown at the bottom of the panel.

[0540]FIG. 77B is a scanned image of four-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and embryos containing a mutation in the Sec61 Alpha nucleic acid sequence are shown below it.

[0541]FIG. 78 is a scanned image of two-day old zebrafish embryos. A wild-type embryo is shown at the top and three embryos containing a mutation in the BAF53a nucleic acid sequence are shown at the bottom of the panel.

[0542]FIG. 79A is a scanned image of three-day old zebrafish embryos, lateral views. A wild-type embryo is shown at the top of the panel, and an embryo containing a mutation in the Histone Deacetylase nucleic acid sequence is shown at the bottom of the panel.

[0543]FIG. 79B is a scanned image of three-day old zebrafish embryos, dorsal views. A wild-type embryo is shown at the right of the panel, and an embryo containing a mutation in the Histone Deacetylase nucleic acid sequence is shown at the left of the panel.

[0544]FIG. 80A is a scanned image of a four-day old zebrafish embryo, lateral view. A wild-type embryo is shown.

[0545]FIG. 80B is a scanned image of a four-day old zebrafish embryo, lateral view. An embryo containing a mutation in the Fibroblast Isoform of the ADP/ATP Carrier Protein nucleic acid sequence is shown.

[0546]FIG. 81 is a scanned image showing a 48-hour old wild-type zebrafish embryo (Top) and a 48-hour old zebrafish embryo containing a mutation in a 459 nucleic acid sequence (Bottom).

[0547]FIG. 82 is a scanned image of a four-day old zebrafish embryo containing a mutation in a 459 nucleic acid sequence. The arrow points to a kidney cyst, which is also circled with a dashed line.

[0548]FIG. 83A is a scanned image of a transverse section through a wild-type zebrafish embryo. The arrow points to the kidney tubule and the notochord is labeled “nc.”

[0549]FIG. 83B is a scanned image of a transverse section through a zebrafish embryo containing a mutation in a 459 nucleic acid sequence. The arrow points to the kidney tubule which is swollen into a large cyst. The notochord is labeled “nc.”

[0550]FIG. 84 is a scanned image of a dorsal view of a 45-hour old wild-type zebrafish embryo. This embryo was stained with a polyclonal antibody raised against the C-terminal region of a 459 polypeptide. The dotted line represents the lateral boundary of one of the kidney tubules. The staining is localized to the apical surface of the epithelial cells lining the tubule.

DETAILED DESCRIPTION

[0551] We devised a strategy for the most efficient breeding and screening of proviral insertions so that we could perform a large-scale screen. By breaking this multi-step experiment into component projects and designing a protocol for each, we established a successful method for identifying vertebrate genes that are essential for normal development. Here we describe the screening methods and report on the first 80 developmental mutants obtained in our screen and the rapid cloning of the mutated genes.

[0552] The intent of this screen is to recover all embryonic mutations visible in a dissecting microscope at 1, 2, and 5 days post fertilization. As was found in large chemical mutagenesis screens in the zebrafish, mutants identified in this way include those with highly specific developmental defects involving one or a few organ systems and mutants that display one of several more general, and frequently recurring “syndromes.” In our experience, some mutants fall between these two groups, having specific aspects in combination with more widespread abnormalities. We refer to these phenotypes as “mixed.” Like almost all embryonic mutations ever isolated in the zebrafish by any method of screening, our mutants are recessive lethals. Most homozygous mutant embryos die between 3-10 days of age.

[0553] The general screening method used to identify the 80 genes described in this application involved the following steps. First, we injected high titer retrovirus into zebrafish embryos at the 1000-2000 cell stage. Approximately 36,000 of these injected fish (founders) were raised and pair-mated to generate 10,000 families of F1 fish. To identify fish with the most non-overlapping proviral inserts, 30 fish from each F1 family were analyzed by real-time quantitative PCR analysis of DNA extracted from tail fin clips. The tail fin clips from the eight fish containing the greatest number of proviral inserts, as assessed by real time quantitative PCR analysis, were further characterized using Southern blot hybridization. Fish with at least 3 unique proviral inserts were selected and pair-mated to generated 10,000 F2 families. The F2 fish were raised and siblings from these F2 families were crossed to generate F3 families, which were visually screened for developmental defects.

[0554] Since the viral vectors used in the experiments described herein generate a 4 bp duplication when they integrate, the insertion sites described herein are the approximate locations within the given nucleic acid sequences and the exact location may vary by as many as 4 nucleotides in either direction from that provided in the following descriptions. Described herein are the first 80 of the developmental mutants identified in the screen thus far.

[0555] The 904 Gene

[0556] Insertional mutations in the 904 nucleic acid sequence result in a severe disorganization of the brain and central nervous system (CNS) including, an overgrowth of neuronal tissue, a lack of definition of the brain compartments, increased vascularization, and brain hemorrhages. These phenotypes are already apparent two days into development and became more pronounced during day three, four, and five. 904 mutant embryos are still alive by day five as evidenced by the heart continuing to beat and the embryo responding to touch. In addition, a tail kink becomes evident by day four of development. The tail kink is indicative of the neural tube being disorganized.

[0557] We mapped the insertion site of the F5 virus in the 904 nucleic acid sequence to be approximately at nucleotide 1315 of SEQ ID NO:1. Analysis of the 904 amino acid sequence showed that it contains a zinc-binding domain which spans amino acids 80-125 of SEQ ID NO:2, and which may be important for protein degradation and cell cycle regulation. The 904 amino acid sequence also contains ankyrin repeats.

[0558] In addition, the zebrafish 904 amino acid sequence is 83% identical and 87% similar to a Drosophila melanogaster protein of unknown function (the CG 5841 gene product; GenBank Accession No. AAF49551.1) over a region encompassing amino acids 8 to 236 of SEQ ID NO:2. The 904 amino acid sequence is also 92% identical and 96% similar to the protein encoded by Homo sapiens clone IMAGE:3350926 (GenBank Accession No. BE255862) the over a region spanning 151 amino acids of SEQ ID NO:2. Furthermore, the 904 nucleic acid sequence (SEQ ID NO:1) is 79% identical over a region spanning 469 nucleotides of the human IMAGE:3350926 nucleotide sequence.

[0559] The POU2 Gene

[0560] We isolated zebrafish mutants containing a virus insertion in the POU2 gene (GenBank Accession No. D28548). This gene was previously described by Takeda et al. (Genes & Dev. 8:45-49, 1994). POU domain proteins are a large family of transcriptional regulatory proteins that may play roles in the regulation of gene expression in early development. The zebrafish POU2 protein is 37% identical and 47% similar to the human Brain-1 protein (GenBank Accession No. NP_(—)006227.1) over a stretch of amino acids encompassing amino acids 81 to 426 of SEQ ID NO:4. The nucleotide sequence encoding the human Brain-1 protein is 76% identical over 196 nucleotides to the zebrafish POU2 gene (SEQ ID NO:3; GenBank Accession No. D28548).

[0561] The POU2 mutants that we isolated display a lack of a mid-brain/hind-brain boundary and only have one otolith. This phenotype is visually detectable on day one of development, but may be observed earlier by in situ hybridization. In addition, the phenotype becomes more pronounced over time, but POU2 mutant embryos are still alive on day five of development.

[0562] The hair cells of the otolith can be visualized by staining for actin bundles. Accordingly one skilled in the art can readily determine the number of hair cells in a developing zebrafish embryo. POU2 mutant embryos may therefore be used in screens for test compounds that affect the number of hair cells in these embryos. A test compound that, when contacted with a POU2 embryo, results in an increase in the number of hair cell, is a candidate neuroprotective compound.

[0563] We mapped the insertion site of the virus in the POU2 gene and found the F5 virus insertion of allele hi349 to be approximately at nucleotide 1088, and the GT virus insertion of allele hi1940 to be at nucleotide 653, of SEQ ID NO:3.

[0564] The 40S Ribosomal Protein S18 Gene

[0565] We isolated zebrafish mutants containing a viral insertion in the 40S Ribosomal Protein S18 gene (GenBank Accession No. AF210641). We determined the GT virus insertion to be between nucleotides 220 and 221 of SEQ ID NO:5. The zebrafish gene product is 97% identical and 98% similar to the Mus musculus or Homo sapiens 40S Ribosomal Protein 18S (GenBank Accession Nos. NP_(—)035426.1 and P25232, respectively) over 152 amino acids of SEQ ID NO:6. The zebrafish gene is 82% identical over 410 nucleotides of SEQ ID NO:5 to the human 40S Ribosomal Protein S18.

[0566] Zebrafish mutant for the 40S Ribosomal Protein S18 have a head and eyes that are at least 50% smaller than those of identically aged wild-type zebrafish, a kinked tail, a reduced forebrain (generally only 50% the size of wild-type), and a bigger hind-brain. These defects may be observed on day two of development, but the embryos continue to be alive at day five of development.

[0567] The U2AF Gene

[0568] We isolated zebrafish mutants containing a viral insertion in the U2 Small Nuclear Rna Auxiliary Factor (U2AF) gene and found the insertion to be approximately between nucleotides 46 and 47 of SEQ ID NO:7 (GT virus for allele hi2505A and F5 virus for allele hi199). The zebrafish gene product is 92% identical and 94% similar to the human U2AF protein (GenBank Accession No. NP_(—)006749.1) in the region encompassing amino acids 1-249 of SEQ ID NO:8. In addition, the zebrafish U2AF gene is 82% identical to the Mus musculus homologue (GenBank Accession No. AK012334) over a region encompassing 537 nucleotides of SEQ ID NO:7. This zebrafish gene is also 82% identical to the Homo sapiens homologue (GenBank Accession No. XM_(—)009787.3) over a region encompassing 491 nucleotides of SEQ ID NO:7.

[0569] The zebrafish U2AF mutants display general brain necrosis by day two of development, with a particularly strong effect in the tectum.

[0570] The 954 Gene

[0571] We isolated zebrafish mutants containing a GT viral insertion approximately at nucleotide 432 or 506 of the 954 gene (SEQ ID NO:9). The coding region of the 954 gene is contained in the region spanning nucleotides 583 to 1252 of SEQ ID NO:9. The zebrafish 954 gene product is 93% identical and 96% similar to the human FLJ23591 protein (Accession No. NP_(—)079352.1).

[0572] In these mutants, cartilage does not stain with Alcian blue, but cartilage cells are visible in tissue sections. The zebrafish 954 gene is similar to Arabidopsis and Synechosystis dTDP-glucose 4-6-dehydratase.

[0573] The Nrp-1 Gene

[0574] We isolated zebrafish mutants containing a viral insertion in the Neurogenin Related Protein-1 (Nrp-1) gene (GenBank Accession No. AF036149). We determined the GT viral insertion to map approximately to nucleotide 1149 of SEQ ID NO:11. The Nrp-1 coding regions spans nucleotides 114 to 735 of SEQ ID NO:11 and the amino acid sequence is provided in SEQ ID NO:12.

[0575] Zebrafish mutant for the Nrp-1 gene have motility problems and are touch insensitive around the head, but not around the tail. By day five of development, zebrafish mutant for the Nrp-1 gene have a gaping jaw. In mice, a knockout mutation of this gene results in defects in cell fate determination of the neural crest.

[0576] The Cad-1 Gene

[0577] We isolated zebrafish mutants containing a viral insertion in the Caudal (Cad-1) gene (GenBank Accession No. X66958.1), a homeobox-containing transcription factor. We determined the GT virus insertion to be approximately between nucleotides 583 and 584 of SEQ ID NO:13 in both the hi2188A and 2092 alleles. The zebrafish Cad-1 gene product is 52% identical and 62% similar to the Homo sapiens homeobox protein CDX4 (GenBank Accession No. NP_(—)005184.1) over 258 amino acids of SEQ ID NO:14. In addition, the zebrafish Cad-1 gene is 80% identical to the Homo sapiens caudal type homeobox transcription factor 2 (CDX2; GenBank Accession No. XM_(—)039996.1) over 186 nucleotides of SEQ ID NO:13 and 83% identical to Homo sapiens CDX4 (GenBank Accession No. XM_(—)010453.1) over 102 nucleotides of SEQ ID NO:13.

[0578] On day one of development, these mutants have a shortened trunk and tail and no yolk sac extension. By day two of development, zebrafish mutant for Cad-1 are no longer motile and they are touch insensitive.

[0579] The V-ATPase Alpha Subunit Gene

[0580] We isolated zebrafish mutants containing an insertion of an F5 virus approximately at nucleotide 169 of the zebrafish V-ATPase Alpha Subunit gene (V-ATPase Alpha Subunit; SEQ ID NO:15). The zebrafish V-ATPase Alpha Subunit gene is 87% identical to Mus musculus clone 4930500C14 (GenBank Accession No. AK015654) over 185 nucleotides of SEQ ID NO:15, and 85% identical to the Homo sapiens LOC90423 gene (GenBank Accession No. XM_(—)031576.1) over 223 nucleotides of SEQ ID NO:15. In addition, the zebrafish V-ATPase Alpha Subunit gene product is 77% identical and 89% similar to the human homologue (GenBank Accession No. AAH04443) over a region spanning 226 amino acids of SEQ ID NO:16.

[0581] Mutants in this gene have reduced pigmentation in both the body and the eye.

[0582] The V-ATPase SFD Subunit Gene

[0583] We isolated zebrafish mutants containing an insertion of an F5 virus between approximately nucleotides 31 and 32 of the zebrafish VATPase SFD Subunit gene (SEQ ID NO:17). The coding region of the zebrafish V-ATPase SFD Subunit gene spans nucleotides 57 to 1445 of SEQ ID NO:17. The zebrafish V-ATPase SFD Subunit gene product is 89% identical and 94% similar to a Sus scrofa V H⁺-ATPase gene product (GenBank Accession No. AJ223757), and is 88% identical and 93% similar to the Homo sapiens MSTP042 protein (GenBank Accession No. AF 13222) over a stretch of 464 amino acids of SEQ ID NO:18. In addition, the zebrafish V-ATPase SFD Subunit gene is 81% identical to 537 nucleotides of the human LOC51606 mRNA over a region spanning 537 nucleotides of SEQ ID NO:17.

[0584] Mutants in this gene have reduced pigmentation in both the body and the eye by day three of development.

[0585] The V-ATPase 16 kDa Proteolytic Subunit Gene

[0586] We isolated zebrafish mutants containing an insertion of a virus approximately between nucleotides 242 and 243 of the zebrafish V-ATPase 16 kDa Proteolytic Subunit gene (SEQ ID NO:19). The coding region of the zebrafish V-ATPase 16 kDa Proteolytic Subunit gene spans nucleotides 170-631 of SEQ ID NO:19. We determined that the zebrafish V-ATPase 16 kDa Proteolytic Subunit gene is 86% identical to the Ovis aries homologue (GenBank Accession No. AF027705) over a region spanning 179 nucleotides, and 81% identical to the human homologue (GenBank Accession No. NM_(—)001694) over a region spanning 425 nucleotides, of SEQ ID NO:19. In addition, the zebrafish V-ATPase 16 kDa Proteolytic Subunit gene product is 91% identical and 94% similar to the corresponding human homologue (GenBank Accession No: P27449) over a region spanning 148 amino acids, and 91% identical and 95% similar to the mouse homologue (GenBank Accession No. NP_(—)033859.1) over a region spanning 149 amino acids, of SEQ ID NO:20.

[0587] Mutants in this gene have reduced pigmentation in both the body and the eye by 48 hours of development, as well as reduced touch sensitivity by 72 hours of development.

[0588] The 1463 Gene

[0589] We isolated zebrafish mutants containing an insertion of a GT virus approximately between nucleotides 389 and 390 in the zebrafish 1463 gene (SEQ ID NO:157). The coding region of the zebrafish 1463 gene spans nucleotides 266 to 1858 of SEQ ID NO:157. The zebrafish 1463 gene is 74% identical to the nucleic acid sequence encoding the Homo sapiens LimpII protein (GenBank Accession No. D12676) over a region spanning 175 bp of SEQ ID NO:157. In addition, the zebrafish 1463 gene product is 44% identical and 68% similar to the human Limp2 protein (GenBank Accession No. A56525) in a region encompassing amino acids 6-474 of SEQ ID NO:158.

[0590] These mutants have a defect in body pigmentation, but the eye is unaffected. Furthermore, 1463 mutants display brain dysmorphia including a swollen tectum, and a shorter hind-brain by day two of development and the defect becomes stronger as development progresses. In addition, the hind-brain is at least 10% shorter than wild-type. Zebrafish 1463 mutants also have less body pigment.

[0591] LimpII is part of a family of proteins which includes a transmembrane receptor for thrombospondin 1 (tsp1). In addition, LimpII has also been shown to bind Tsp1. Furthermore, Tsp1 is thought to be a naturally-occurring inhibitor of angiogenesis that limits vessel density in normal tissue and curtails tumor growth and progression (Jimenez et al., Nature Medicine 6:41-80, 2000; Tuszynski and Nicosia, Bioessays 18:71-76, 1996). Accordingly, the zebrafish 1463 gene may function in the development of vasculature in the brain and may be an important target for stroke therapy.

[0592] The VPSP18 Gene

[0593] We isolated zebrafish mutants containing an insertion of a GT virus approximately at nucleotide 2336 of the VPSP18 gene (SEQ ID NO:21), which encodes a protein that is 37% identical and 59% similar, over the C-terminal 592 amino acids of SEQ ID NO:22, to the Drosophila melanogaster Vacuolar Sorting Protein Deep Orange (GenBank Accession No. AE003421). In addition, the VPSP18 gene is 87% identical to the human Vacuolar Protein Sorting Protein 18 gene (GenBank Accession Nos. AF308802 and XP_(—)031478.1) over a stretch of 58 nucleotides of SEQ ID NO:21 and is 67% identical and 84% similar to this human protein over a region encompassing 604 amino acids of SEQ ID NO:22.

[0594] Mutants in the VPSP18 gene show a decrease in pigmentation, a lack of iridophores, and some necrosis in the tectum.

[0595] The Pescadillo Gene

[0596] We isolated zebrafish mutants containing a viral insertion in the Pescadillo gene, for example, approximately at nucleotide 20 of GenBank Accession No. U77627. This gene is required for the normal size of some, but not all, embryonic organs (Allende et al., Genes Dev. 10:3141-3155, 1996). In addition, the protein encoded by this gene contains a BRCF motif. Furthermore, Pescadillo was recently identified as a gene whose expression is elevated in p53 deficient tumor cell lines and it is now thought to play a role in cell cycle check points (Charpentier et al., Cancer Res. 60:5977-5983, 2000; Kinoshita et al., J. Biol. Chem. 276:6656-6665, 2001).

[0597] The HNF1-β/vHNF1 Gene

[0598] We isolated zebrafish mutants containing a viral insertion in the HNF1-β/vHNF1 gene. The hi548 allele is the result of an F5 virus insertion approximately at nucleotide 1682/1683, the hi1843 allele is the result of a GT virus insertion at nucleotide 745, and the hi2169 allele is the result of a GT virus insertion at nucleotide 361, of SEQ ID NO:23. The coding region of the zebrafish HNF1-β/vHNF1 gene is contained within nucleotides 143 to 1819 of SEQ ID NO:23. This gene is 84% identical to the Rattus norvegicus Hepatic Transcription Factor 1 gene (GenBank Accession No. NM_(—)012669.1) over 227 nucleotides of SEQ ID NO:23. This zebrafish gene, SEQ ID NO:23, also is identical to various stretches of the Homo sapiens Hepatic Transcription Factor 1 nucleotide sequence (GenBank Accession No. XM_(—)012120.3), 77% identical over 381 nucleotides, 78% over 375 nucleotides nucleotides, 80% identical over 270 nucleotides, 81% identical over 149 nucleotides, and 74% identical over 95 nucleotides. In addition, the zebrafish HNF1-β/vHNF1 protein (SEQ ID NO:24) is 80% identical and 87% similar to the human homologue (GenBank Accession Nos. NP_(—)000449.1, XP_(—)008554.1, S34412, X58840, and U90287).

[0599] Mutants in the HNF1-β vHNF1 gene display a cystic kidney and an abnormal pancreas. Zebrafish containing the hi2169 allele also have defects in the patterning of the hind-brain, resulting in defective ear (otolith) structures. In addition, mutations in the human HNF1-β/vHNF1 gene has also been found to cause of a genetic form of human diabetes, MODY V (maturity onset diabetes of the young), in which patients have kidney defects in addition to diabetes (Iwasaki et al., Diabetes Care 21:2144-2148, 1998; Horikawa et al., Nat. Genet. 17:384-385, 1997).

[0600] The 60S Ribosomal Protein L35 Gene

[0601] We isolated zebrafish mutants containing a viral insertion approximately between nucleotides 30 and 31 of the 60S Ribosomal Protein L35 nucleic acid sequence (SEQ ID NO:25). The coding region of the zebrafish 60S Ribosomal Protein L35 spans nucleotides 29-397 of SEQ ID NO:25 and this gene is 82% identical to the human homologue (GenBank Accession No. BC000348.1) over a region spanning 319 nucleotides of SEQ ID NO:25. In addition, the zebrafish 60S Ribosomal Protein L35 is 92% identical and 95% similar to the human protein (GenBank Accession No. NP_(—)009140.1) over a region spanning 123 amino acids of SEQ ID NO:26.

[0602] The zebrafish 60S Ribosomal Protein L35 mutants display an inflated ventricle, have a head and eyes that are at least 50% smaller than those of identically aged wild-type zebrafish by 48 hours of development, and have blurred somite boundaries.

[0603] The 60S Ribosomal Protein L44 Gene

[0604] We isolated zebrafish mutants containing a viral insertion in the 60S Ribosomal Protein L44 gene (described in Amsterdam et al. (Genes & Dev. 13:2713-2724, 1999)). We determined the viral insertion to be approximately between nucleotides 195 and 196 of the sequence of SEQ ID NO:27. Late in day one of development, these mutants have an enlarged brain ventricle and the yolk disappears from the yolk sac extension. 60S Ribosomal Protein L44 mutant embryos die by day four or five of development. The zebrafish 60S Ribosomal Protein L44 gene is 85% identical to the Mus musculus homologue (GenBank Accession No. NM_(—)019865.1) over a region spanning 324 nucleotides of SEQ ID NO:27. In addition, the zebrafish 60S Ribosomal Protein L44 gene product is 98% identical and 98% similar to the Mus musculus 60S Ribosomal Protein L44 gene product (GenBank Accession No. NP_(—)063918.1) over the region encompassing amino acids 1-106 of SEQ ID NO:28.

[0605] The CopZ1 Gene

[0606] We isolated zebrafish mutants containing an insertion of an F5 virus approximately between nucleotides 90 and 91 of the CopZ1 gene (SEQ ID NO:29; GenBank Accession No. AB040044). The coding region of this gene spans nucleotides 73 to 606 of SEQ ID NO:29 and the amino acid sequence of the CopZ1 gene product is provided in SEQ ID NO:30.

[0607] CopZ1 mutant zebrafish display a degeneration of the eye, especially in the retinal, pigmented epithelia. However, the neuronal layers of the retina also begin to degenerate starting on day four of development.

[0608] The 215 Gene

[0609] We isolated zebrafish mutants containing an insertion of an F5 virus approximately between nucleotides 294 and 295 of the 215 gene (SEQ ID NO:31). The coding region of the zebrafish 215 gene spans nucleotides 47-622 of SEQ ID NO:31 and the 215 gene product, which is similar to ATP-dependent RNA helicases, contains a DEAD-Box helicase domain between amino acids 24 and 228, as well as a Helicase C domain between amino acids 292 and 363. Furthermore, the zebrafish 215 gene is 81% identical to the human gene encoding the KIAA1595 protein (GenBank Accession Nos. AB046815 and BAB13421.1) over a region spanning 188 nucleotides of SEQ ID NO:31 and the gene product is 78% identical and 88% similar to the gene product of this human gene over a region spanning 393 amino acids of SEQ ID NO:32. The zebrafish 215 gene product is also 77% identical and 86% similar to the Mus musculus AK012782 gene product (GenBank Accession No. BAB28466.1) over a region spanning 529 amino acids of SEQ ID NO:32.

[0610] By day three of development, these mutants have eyes that are at least 75% smaller than those of three day-old wild-type zebrafish, a jaw that is at least 75% reduced when compared to that of a three day-old wild type zebrafish, and display general underdevelopment. In addition, using Alcian blue staining, we observed a bent ceratohyal.

[0611] The 307 Gene

[0612] We isolated zebrafish mutants containing an insertion of an F5 virus approximately at nucleotide 176 of the 307 gene (SEQ ID NO:33). The coding region of the 307 gene may begin either at nucleotide 333 or 339 of SEQ ID NO:33. The zebrafish 307 gene is 94% identical over a region spanning 34 nucleic acids of SEQ ID NO:33, and the 307 gene product is 54% identical and 68% similar over a stretch of 199 amino acids of SEQ ID NO:34, to human beta 1,3 glucuronyl transferase (GenBank Accession No. AB009598) and is required for the formation of cartilage and/or jaw structures. In addition, zebrafish containing a mutation in the 307 gene have a mandibular arch that does not extend anteriorly and have slightly misshapen branchial arches 3-7.

[0613] The 572 Gene

[0614] We isolated zebrafish mutants containing an insertion of an F5 virus approximately at nucleotide 277 of the 572 gene. The coding region of the 572 gene spans nucleotides 48-701 of SEQ ID NO:35. The 572 gene product is 37% identical to the human FLJ20508 gene product (GenBank Accession No. NP_(—)060320.1) over a stretch of 196 amino acids of SEQ ID NO:36. By day four of development, mutants in the 572 gene have shorter jaw and branchial arches which are fragmented and/or hard to see.

[0615] The 1116A Gene

[0616] We isolated zebrafish mutants containing an insertion of a GT virus approximately at nucleotide 135 of the 1116A gene (SEQ ID NO:37). The coding region of the 1116A gene spans nucleotides 33 to 578 of SEQ ID NO:37. The 1116A gene product is 42% identical to the human NIH_MGC_(—)93 cDNA clone (GenBank Accession No. BG287661) over a stretch of 191 amino acids of SEQ ID NO:38. By day three of development, we observed a failure of the jaw to develop in 1116A mutants, based on Alcian blue staining.

[0617] The 1548 Gene

[0618] We isolated zebrafish mutants containing an insertion of a GT virus approximately at nucleotide 85 of the 1548 gene (SEQ ID NO:39). The coding region of the 1548 gene spans nucleotides 99 to 2990 of SEQ ID NO:39. The zebrafish 1548 gene is 78% identical to the human gene encoding the DKFZP434B168 protein (GenBank Accession No. NM_(—)015434.1) over a stretch of 503 nucleotides of SEQ ID NO:39, and the nucleic acid sequence encoding the human protein (GenBank Accession No. NP_(—)056249.1) is 76% identical and 87% similar to the zebrafish 1548 gene over 963 nucleotides of SEQ ID NO:40. Zebrafish containing a 1548 mutation have eyes that are slightly smaller than those of identically aged wild-type zebrafish, an abnormal head shape, and edema around the eyes and heart. In addition, these mutants appear to have thicker pectoral fins and jaws. Furthermore, by Alcian blue staining, these mutants have an added structure attached to the parachordal in the neurocranium. While these phenotypes are visible by day three of development, they are more apparent by day five.

[0619] The Casein Kinase 1 α Gene

[0620] We isolated zebrafish mutants containing an insertion of a GT virus approximately between nucleotides 730 and 731 of the Casein Kinase 1 a gene (SEQ ID NO:41). The coding region of the zebrafish Casein Kinase 1 α gene spans nucleotides 505 to 1479 of SEQ ID NO:41. The zebrafish Casein Kinase 1 α gene is 84% identical to the Gallus gallus homologue (GenBank Accession No. AF042862) over a region spanning 976 nucleotides, and is 82% identical to the Homo sapiens Casein Kinase 1 α and clone PP2685 (GenBank Accession Nos. XM_(—)046995.1 and AF218004, respectively) over a region spanning 946 nucleotides, of SEQ ID NO:41. In addition, the zebrafish Casein Kinase 1 α gene product is 99% identical to various vertebrate homologues including the Gallus gallus (GenBank Accession Nos. AF042862 Y08817, and U80822), Rattus norvegicus (GenBank Accession No. U77582), and Homo sapiens (Genbank Accession Nos. XP_(—)046994.1, XP_(—)011309.3, and XP_(—)046996.1), over a region spanning 324 to 325 amino acids of SEQ ID NO:42.

[0621] The mutant phenotype indicates that this gene is required for the formation of cartilage and/or jaw structures. By day three of development, zebrafish mutant for the Casein Kinase 1 α gene have retarded development of the pectoral fins and some of these fins are misshapen. In addition, Alcian blue staining shows that the cartilage of the fins, branchial arches, and jaw is wrinkled.

[0622] The Nodal-Related (Squint) Gene

[0623] We isolated zebrafish mutants containing a viral insertion in the Nodal-Related (Squint) gene (Feldman et al., Nature 395:181-185, 1998; GenBank Accession No. AF056327). We determined that our Nodal-Related (Squint) mutant has a GT virus inserted approximately at nucleotide 654 of SEQ ID NO:43 (GenBank Accession No. AF002218). This location is equivalent to nucleotide 526 of GenBank Accession No. AF056327. The coding region of the zebrafish Nodal-Related (Squint) gene spans nucleotides 177-1355 of SEQ ID NO:43. The Squint gene is 43% identical and 61% similar to the Xenopus laevis Xnr5 gene (GenBank Accession No. BAB18971.1) over a region spanning 355 amino acids, 42% identical and 60% similar to the Xenopus laevis Xnr-2 gene (Genbank Accession No. AAA97393.1) over a region spanning 381 amino acids, 41% identical and 61% similar to the Xenopus laevis Xnr-6 gene (GenBank Accession No. BAB18972.1) over a region spanning 348 amino acids, 41% identical and 60% similar to the Xenopus laevis Xnr-1 gene (GenBank Accession No. AAA97392.1) over a region spanning 367 amino acids, 35% identical and 48% similar to the Homo sapiens Nodal-Related Protein (GenBank Accession No. BAB62524.1) over a region spanning 346 amino acids, and 33% identical and 48% similar to the Mus musculus Nodal-Related or Squint Protein (GenBank Accession No. NP_(—)038639.1) over a region spanning 344 amino acids, of SEQ ID NO:44.

[0624] The Smoothened Gene

[0625] We isolated zebrafish mutants containing a viral insertion in the Smoothened gene (described in Chen et al. (Development 128:2385-2396, 2001); GenBank Accession No. AY029808.1). In the Smoothened mutant zebrafish, we determined that, in the hi 229 allele, an F5 virus inserted approximately at nucleotide 271, and that, in the hi 1640 allele, a GT virus inserted approximately at nucleotide 600, of SEQ ID NO:45. The coding region of the Smoothened gene spans nucleotides 383-2815 of SEQ ID NO:45 and the amino acid sequence of the Smoothened gene product is provided in SEQ ID NO:46.

[0626] Mutations in this gene affect the central nervous system, resulting in a reduction in the number of neurons. Fewer primary motoneurons are present and their axons do not extend correctly. In addition, no secondary motoneurons can be observed in smoothened mutants. The forebrain and midbrain commissures do not form and the optic nerves fail to reach and cross the midline. Further phenotypes observed in smoothened mutants are abnormalities in body shape/axial structures (the body is curved ventrally, the floorplate is reduced, the horizontal myoseptum is missing, the somites are U-shaped, not V-shaped, and mild cyclopia is observed), in cartilaginous structures (the jaw, branchial arches, and pectoral fins are absent), and in muscles (a lack of adaxial muscle tissue, slow muscle fibers, and muscle pioneer cells is observed).

[0627] The 429 Gene

[0628] We isolated zebrafish mutants containing an insertion of an F5 virus approximately between nucleotides 182 and 183 of the 429 gene (SEQ ID NO:47). The coding region of the 429 gene spans nucleotides 43 to 2301 of SEQ ID NO:47. These mutants have a very small liver with no visible circulation, no gall bladder, no pancreas, and the gut is underdeveloped. These phenotypes begin to appear by day three of development and are easily discernible by day five of development. The zebrafish 429 gene product is 53% identical and 70% similar to a human protein (GenBank Accession No. NP_(—)055203.1) over a region spanning 765 amino acids of SEQ ID NO:48 and is 53% identical and 68% similar to the Mus musculus bM282D4.5 protein (GenBank Accession No. CAC42185.1) protein over a region spanning 771 amino acids of SEQ ID NO:48.

[0629] The 428 Gene

[0630] We isolated zebrafish mutants containing an insertion of a GT virus approximately at nucleotide 187 of the 428 gene (SEQ ID NO:49). The coding region of the 428 gene spans nucleotides 44-629 of SEQ ID NO:49. This gene is 85% identical to the Mus musculus 2810405O22Rik sequence (GenBank Accession No. NM_(—)026042.1) over a region spanning 137 nucleotides of SEQ ID NO:49, and is 89% identical to the Homo sapiens DKFZp434H247 protein (GenBank Accession Nos. XM_(—)046121.1 and AL137304.1) over a region spanning 68 nucleotides of SEQ ID NO:49. In addition, the 428 gene product is 62% identical and 71% similar to the Homo sapiens DKFZp434H247 protein (GenBank Accession Nos. CAB70687.1 and BAA91147.1) over a region spanning 170 amino acids of SEQ ID NO:50.

[0631] Zebrafish mutant for the 428 gene have more muscle septa and striations than identically aged wild-type zebrafish. In addition, by day five of development these zebrafish have necrosis of the brain.

[0632] The Spinster Gene

[0633] We isolated zebrafish mutants containing a viral insertion in the spinster gene. These mutants are the result of an SGF virus inserted several kilobases, e.g., 2 to 5 kb, upstream of the coding region, which spans nucleotides 209-1729 of the Spinster gene (SEQ ID NO:51). The zebrafish Spinster gene is 83% identical to the human clone IMAGE:3627317 sequence (GenBank Accession No. BC006156) over a region spanning 81 nucleotides of SEQ ID NO:51. In addition, the zebrafish spinster gene product is 64% identical and 75% similar to the Homo sapiens (GenBank Accession Nos. NP_(—)114427.1, AAG43830.1, and AAH08325.1) or Mus musculus (GenBank Accession Nos. NP_(—)076201.1 and AAG43831.1) Spinster-Like Proteins over a region spanning 527 or 528 amino acids, respectively, of SEQ ID NO:52.

[0634] Zebrafish mutant for this gene show a degeneration of the yolk by day two of development, resulting in a gradual death of the embryo.

[0635] The Glypican-6 or Knypek Gene

[0636] We isolated zebrafish mutants containing an insertion of a GT virus approximately at nucleotide 1054 (hi 1688 allele) or nucleotide 133 (hi 1934 allele) of the Glypican-6 or Knypek gene (SEQ ID NO:53). The coding region of the zebrafish Glypican-6 or Knypek gene spans nucleotides 302-1972 of SEQ ID NO:53. The zebrafish Glypican-6 or Knypek gene (GenBank Accession No. AF354754) is 87% identical to the Homo sapiens Glypican 4 gene (GenBank Accession No. XM_(—)010339.3) over a region spanning 86 nucleotides, and is 82% identical to the Homo sapiens Glypican 6 gene (GenBank Accession Nos. XM_(—)015995.2 and AF105267.1) over a region spanning 133 nucleotides, of SEQ ID NO:53. In addition, the zebrafish Glypican-6 or Knypek gene product is 56% identical and 75% similar to the Mus musculus Glypican 6 Protein (GenBank Accession No. AF105268_(—)1) over a region spanning 541 amino acids, and is 55% identical and 74% similar to the Homo sapiens Glypican 6 Protein (GenBank Acession Nos. AF111178_(—)1 and AF105267_(—)1) over a region spanning 542 amino acids, of SEQ ID NO:54. Furthermore, the zebrafish Glypican-6 or Knypek gene product is 58% identical and 72% similar to the Mus musculus Glypican 4 Protein (GenBank Accession No. NP_(—)032176.1) over a region spanning 533 amino acids, and is 56% identical and 70% similar to the Homo sapiens Glypican 4 Protein over a region spanning 550 amino acids, of SEQ ID NO:54.

[0637] By day one of development, zebrafish containing a mutation in the Glypican-6 or Knypek gene have a shortened tail and U-shaped somites.

[0638] The Ribonucleotide Reductase Protein R1 Class 1 Gene

[0639] We isolated zebrafish mutants (hi 318 and hi 2769A) containing a viral insertion in the Ribonucleotide Reductase Protein R1 Class 1 gene (GenBank Accession No. U57964). The hi 318 allele has an insertion of an F5 virus, and the hi 2769A allele has an insertion of a GT virus, approximately between nucleotides 147 and 148 of SEQ ID NO:55. The coding region of the zebrafish Ribonucleotide Reductase Protein R1 Class 1 gene spans nucleotides 131 to 2512 of SEQ ID NO:55.

[0640] Zebrafish having a mutation in this gene have a bent, convex body shape. In addition, these mutants display transient brain and eye necrosis between 24 and 48 hours of development.

[0641] The Kinesin-Related Motor Protein EG5 Gene

[0642] We isolated zebrafish mutants (alleles hi 486 and hi 3112A) containing a viral insertion in the Kinesin-Related Motor Protein EG5 gene. The hi 486 allele contains an insertion of an F5 virus, ad the hi 3112A allele contains an insertion of a GT virus, approximately between nucleotides 50 and 51 of SEQ ID NO:57. This gene is 80% identical to the Xenopus laevis gene encoding a kinesin-like protein (GenBank Accession No. X71864.1) over a region spanning 538 nucleotides of SEQ ID NO:57. The zebrafish Kinesin-Related Motor Protein EG5 gene is also 80% identical to the Mus musculus Kinesin-Related Mitotic Motor Protein gene (GenBank Accession No. AJ223293.1) over 320 nucleotides, and is 80% identical to the Homo sapiens Kinesin-Like 1 gene (GenBank Accession No. XM_(—)005889.4) over 310 nucleotides, of SEQ ID NO:57. Furthermore, the zebrafish Kinesin-Related Motor Protein EG5 gene product is 55% identical and 71% similar to the Xenopus laevis homologue (GenBank Accession Nos. P28025, A40264, and CAA37950.1) over a region spanning 948 amino acids of SEQ ID NO:58. This protein is also 50% identical and 67% similar to the Homo sapiens Kinesis-Like Spindle Protein HSKP (GenBank Accession Nos. NP_(—)004514.2, XP_(—)051151.1, XP_(—)051152.1, XP_(—)005889.3, G02157, and AAA86132.1), and the Homo sapiens Kinesin-Related Protein EG5 (Genbank Accession Nos. P52732 and CAA59449.1), over a region spanning 948 amino acids of SEQ ID NO:58.

[0643] While the mutant phenotype is not 100% penetrant, zebrafish mutant for the Kinesin-Related Motor Protein EG5 gene generally have bent bodies. In addition, these mutant embryos display elevated levels of apoptotic cells on their surface by 48 hours of development.

[0644] The 459 Gene

[0645] We isolated zebrafish mutants containing an insertion of an F5 virus approximately at nucleotide 210 of the 459 nucleic acid sequence (SEQ ID NO:59). The zebrafish 459 nucleic acid sequence is 74% identical to the Xenopus laevis clone IMAGE:3744559 (GenBank Accession No. BG016943.1) over a region spanning 415 nucleotides of SEQ ID NO:59. In addition, this zebrafish nucleic acid sequence is 73% identical to the Mus musculus clone 4933412L17 (GenBank Accession No. AK016794.1) over a region spanning 584 nucleotides of SEQ ID NO:59. In addition, the polypeptide encoded by the 459 nucleic acid sequence is 72% identical and 90% similar to the polypeptide encoded that the Xenopus laevis IMAGE:3744559 clone (GenBank Accession No. BG016943.1) over a region spanning 233 amino acids of SEQ ID NO:60 and is 80% identical and 92% similar to the Mus musculus IMAGE 1245647 clone (GenBank Accession No. AA797813.1) polypeptide over a region spanning 200 amino acids of SEQ ID NO:60.

[0646] Zebrafish embryos having a mutation in a 459 nucleic acid sequence show apoptosis in the CNS on day one of development (FIG. 33A) and have a curved body by day two of development (FIGS. 33B and 81). In contrast to wild-type embryos of similar age, by day four of development, 459 mutant embryos have kidney tubules that are swollen into cysts (FIGS. 82, 83A, and 83B). Using a polyclonal antibody against the C-terminus of the polypeptide encoded by a 459 nucleic acid sequence, we observed that the 459 polypeptide is localized to the apical surface of the epithelial cells lining the kidney tubule (FIG. 84).

[0647] As described below, 459 amino acid and nucleic acid sequences may be used to identify drug targets and may be used to diagnose, prevent, and treat kidney and proliferative disorders using the exemplary methods provided herein.

[0648] The Wnt5 (Pipetail) Gene

[0649] We isolated zebrafish mutants (alleles hi 1780B and 2735B) containing a viral insertion in the Wnt5 (pipetail) gene (GenBank Accession No. U51268.1). The 1780B allele has an insertion of a GT virus approximately at nucleotide 397, and the 2735B allele has an insertion of a GT virus approximately between nucleotides 530 and 531, of SEQ ID NO:61. The coding region of the Wnt5 (pipetail) gene spans nucleotides 190-1281 of SEQ ID NO:61 and the amino acid sequence of the Wnt5 (pipetail) gene product is provided in SEQ ID NO:62. These mutants have a truncated tail; however, the extent of the truncation is variable.

[0650] The Aryl Hydrocarbon Receptor Nuclear Translocator 2A Gene

[0651] We isolated zebrafish mutants (alleles hi 1715 and hi 2639C) containing a viral insertion in the Aryl Hydrocarbon Receptor Nuclear Translocator 2A gene (GenBank Accession No. AF155066). The hi 1715 allele has a GT virus inserted approximately at nucleotide 229, and the 2639C allele has a GT virus inserted approximately at nucleotide 240 of SEQ ID NO:63. The amino acid sequence of the zebrafish Aryl Hydrocarbon Receptor Nuclear Translocator 2A gene product is provided in SEQ ID NO:64.

[0652] Zebrafish containing a mutation in this gene show very little motility and a minimal tap response by day five of development.

[0653] The Vesicular Integral-Membrane Protein VIP 36 Gene

[0654] We isolated zebrafish mutants containing an insertion of an F5 virus approximately between nucleotides 219 and 220 of the Vesicular Integral-Membrane Protein VIP 36 gene (SEQ ID NO:65). The coding region of this zebrafish gene spans nucleotides 103 to 1110 of SEQ ID NO:65. The Vesicular Integral-Membrane Protein VIP 36 gene is 76% identical to a human endoplasmic reticulum glycoprotein gene (GenBank Accession No. AAH00347.1) over a region spanning 235 nucleotides, and is 73% identical to this human protein over a region spanning 271 nucleotides, of SEQ ID NO:65. In addition, the zebrafish Vesicular Integral-Membrane Protein VIP 36 gene product is 48% identical and 66% similar to the Canis familiaris homologue over a region spanning 340 amino acids, and is 61% identical and 78% similar to the human DKFZp564L2423 protein (GenBank Accession Nos. NP_(—)110432.1, CAB66552.2, AAH00347.1, AAH05822.1, and AAH05862.1) over a region spanning 311 amino acids of SEQ ID NO:66.

[0655] These mutants are touch insensitive at day five of development.

[0656] The 299 Gene

[0657] We isolated zebrafish mutants containing an insertion of an F5 virus approximately between nucleotides 47 and 48 of the 299 gene (SEQ ID NO:67). The zebrafish 299 gene is 84% identical to a human putative nucleotide binding protein (GenBank Accession No. BC001024) over a region spanning 89 nucleotides of SEQ ID NO:67. In addition, the zebrafish 299 protein is 44% identical and 63% similar to a Homo sapiens estradiol-induced protein (GenBank Accession Nos. AAH01024.1, XP_(—)003213.4, and BAB55169.1) over a region spanning 563 amino acids of SEQ ID NO:68.

[0658] Zebrafish mutant for this gene have some apoptosis in the eye and brain, they lack a jaw, branchial arches, and have small fins by the end of day four of development.

[0659] The 994 Gene

[0660] We isolated zebrafish mutants containing an insertion of a GT virus approximately between nucleotides 66 and 67 of the 994 gene (SEQ ID NO:69). The coding region of the 994 gene may begin at nucleotide 5 or 80 of SEQ ID NO:69. In addition, the zebrafish 994 gene is 35% identical and 49% similar to a Homo sapiens protein (GenBank Accession No. BAB15418.1) over a region spanning 490 amino acids of SEQ ID NO:70.

[0661] These mutants have small eyes and a small head. The eyes and head are smaller than wild-type. In addition, zebrafish containing a mutation in the 994 gene have abnormal jaws and arches, as well as an underdeveloped stomach.

[0662] The 1373 Gene

[0663] We isolated zebrafish mutants (alleles hi 1373 and hi 3245) containing a viral insertion in the 1373 gene. Both the hi 1373 and the hi 3245 alleles have a GT virus inserted approximately between nucleotides 118 and 119 of SEQ ID NO:71. In addition, the coding region of the 1373 gene spans nucleotides 67 to 396 of SEQ ID NO:71. The zebrafish 1373 gene is 82% identical to the Mus musculus Riken Library clone 1110007B08 (GenBank Accession No. AK003520) and is 84% identical to gene encoding the Homo sapiens MGC1346 protein (GenBank Accession No. XM_(—)039445.1) over a region spanning 333 nucleotides of SEQ ID NO:71. Furthermore, the zebrafish 1373 protein is 91% identical and 95% similar to the Arabidopsis thaliana protein F23F1.8 (GenBank Accession No. AAF82203.1) over a region spanning 104 amino acids of SEQ ID NO:72. The zebrafish 1373 protein is also 100% identical to the Homo sapiens MGC 1346 protein (GenBank Accession Nos. NP_(—)116147.1, XP_(—)039445.1, and CAB62939.1) and its Mus musculus homolog (GenBank Accession No. BAB22833.1) over a region spanning 110 amino acids of SEQ ID NO:72.

[0664] Mutants in this gene display brain and eye necrosis, constriction of the anterior end of the yolk sac extension, and body curvature by day two of development.

[0665] The Denticleless Gene

[0666] We isolated zebrafish mutants containing an insertion of an F5 virus approximately between nucleotides 307 and 308 of the Denticleless gene (SEQ ID NO:73). The coding region of the zebrafish Denticleless gene spans nucleotides 34-1974 of SEQ ID NO:73. The zebrafish Denticleless gene is 79% identical to the human gene encoding the L2DTL protein (GenBank Accession No. NM_(—)016448.1) over a region spanning 181 nucleotides of SEQ ID NO:73. The zebrafish Denticleless protein contains four WD-40 domains, which are located between amino acids 88-125, 128-168, 301-339, and 351-383 of SEQ ID NO:74. In addition, this protein is 46% identical and 58% similar to a Homo sapiens protein (GenBank Acession No. BAA91355.1) over a region spanning 729 amino acids of SEQ ID NO:74. Furthermore, the zebrafish Denticleless protein is 66% identical and 77% similar to the N-terminal 386 amino acids of this human protein.

[0667] Mutants in this gene have a body that bends upwards, brain necrosis extending down the neural tube, and a wrinkled yolk sac on day one of development. By day two of development, the body shape is convex, the somites are wrinkled, the eye shape is irregular, and the yolk sac extension is absent.

[0668] The Ribonucleotide Reductase Protein R2 Gene

[0669] We isolated zebrafish mutants (including alleles hi 688 and hi 1706) containing an insertion of a GT virus in the Ribonucleotide Reductase Protein R2 gene (SEQ ID NO:75; GenBank Accession No. U57965). The hi 688 allele has the virus inserted approximately at nucleotide 137 of SEQ ID NO:75, which corresponds to position 360 of an alternatively spliced form of this gene (GenBank Accession No. AW280665). Similarly, the hi 1706 allele has the virus inserted approximately at nucleotide 342 of GenBank Accession No. AW280665. An additional allele has the virus inserted at nucleotide 337 of GenBank Accession No. AW280665. The coding region of the Ribonucleotide Reductase Protein R2 gene spans nucleotides 130-1290 of SEQ ID NO:75 and starts and nucleotide 352 of GenBank Accession No. AW280665. The amino acid sequence of the zebrafish Ribonucleotide Reductase Protein R2 gene product is provided in SEQ ID NO:76.

[0670] Zebrafish mutant for this gene show necrosis in the CNS and the entire body curls upward by day two of development.

[0671] The TCP-1 Alpha Gene

[0672] We isolated zebrafish mutants (alleles hi 491 and hi 1907) containing a viral insertion in the TCP-1 Alpha gene (SEQ ID NO:77; GenBank Accession No. AF143493, with 59 bp added from GenBank Accession No. AW175148). The hi 491 allele results from an insertion of an F5 virus 140 bp upstream of the gene and the hi 1907 allele results from an insertion of an F5 virus approximately between nucleotides 130 and 131 of SEQ ID NO:77. The coding region of the zebrafish TCP-1 Alpha gene spans nucleotides 64-1734 of SEQ ID NO:77 and the amino acid sequence is provided in SEQ ID NO:78.

[0673] Zebrafish mutant for the TCP-1 Alpha gene have eyes and a head that are at least 50% smaller than wild-type by day four of development.

[0674] The Telomeric Repeat Factor 2 Gene

[0675] We isolated zebrafish mutants containing an insertion of a GT virus approximately between nucleotides 529 and 530 of the Telomeric Repeat Factor 2 gene (SEQ ID NO:79). The coding region of the zebrafish Telomeric Repeat Factor 2 gene spans nucleotides 391-2112 of SEQ ID NO:79. In addition the protein encoded by this zebrafish gene is 31% identical and 45% similar to the human Telomeric Repeat Factor 2 protein (GenBank Accession No. U95970) over a region spanning 354 nucleotides, and is 32% identical and 46% similar to this human protein over a region spanning 200 nucleotides, of SEQ ID NO:79. The amino acid sequence of the zebrafish Telomeric Repeat Factor 2 gene product is provided in SEQ ID NO:80.

[0676] Zebrafish mutant for this gene have some necrosis in the brain and eye by day two of development.

[0677] The SIL Gene

[0678] We isolated zebrafish mutants containing an insertion of a GT virus approximately between nucleotide 273 and 274 of the SIL gene. The coding region of the zebrafish SIL gene spans nucleotides 274 to 4059 of SEQ ID NO:81. The zebrafish SIL gene is 79% identical to the DNA sequence encoding the Homo sapiens Tall (SCL) Interrupting Locus (GenBank Accession Nos. NP_(—)003026.1, AAA60550.1, and AAK51418.1) over a region spanning 77 nucleotides, and is 75% identical to this human gene over a region spanning 96 nucleotides, of SEQ ID NO:81. In addition, the zebrafish SIL gene product is 36% identical and 51% similar to the Mus musculus Tall Interrupting Locus gene product (GenBank Accession Nos. NP_(—)033211.1, AAC52386.1, and CAC14001.1) over a region spanning 1348 amino acids of SEQ ID NO:82. In addition, this zebrafish gene product is also 36% identical and 50% similar to the protein encoded by the Homo sapiens SCL Interrupting Locus gene (GenBank Accession Nos. NP_(—)003026.1, AAA60550.1, and AAK51418.1) over a region spanning 1363 amino acids of SEQ ID NO:82.

[0679] Zebrafish mutant for the 1262 gene have brain necrosis, a head that is at least 33% smaller than wild-type, and a bent body by day two of development. By day three of development, these zebrafish have a motility defect.

[0680] The U1 Small Nuclear Ribonucleoprotein C Gene

[0681] We isolated zebrafish mutants containing an insertion of a GT virus approximately between nucleotides 52 and 53 of the U1 Small Nuclear Ribonucleoprotein C gene (SEQ ID NO:83). The coding region of the zebrafish U1 Small Nuclear Ribonucleoprotein C gene spans nucleotides 45 to 521 of SEQ ID NO:83. In addition, the zebrafish U1 Small Nuclear Ribonucleoprotein C gene is 85% identical to the human homologue (GenBank Accession No. XM_(—)004292.3) over a region spanning 170 nucleotides of SEQ ID NO:83. Furthermore, this zebrafish gene is 83% identical and 84% similar to the Xenopus laevis homologue (GenBank Accession No. CAA45354.1) over a region spanning 159 amino acids, and is 84% identical and 85% similar to the human homologue (GenBank Accession Nos. NP_(—)003084.1, XP_(—)043295.1, and CAA31037.1) over a region spanning 159 amino acids, of SEQ ID NO:84.

[0682] By day two of development, zebrafish mutant for this gene have a body that is curved upwards, some brain necrosis, motility problems, and smaller otoliths. Furthermore, by day three of development, U1 Small Nuclear Ribonucleoprotein C gene mutant zebrafish have small and irregular eyes, retarded fins, and more pigment in the hind-brain.

[0683] The Ski Interacting Protein (SKIP) Gene

[0684] We isolated zebrafish mutants containing an insertion of an F5 virus approximately 1.2 kb upstream of the beginning of the zebrafish Ski Interacting Protein (SKIP) gene. The coding region of the zebrafish Ski Interacting Protein (SKIP) gene spans nucleotides 19 to 1626 of SEQ ID NO:85. The nucleic acid sequence encoding zebrafish Ski Interacting Protein is 79% identical to that of the human homologue (GenBank Accession No. U51432) over a region spanning 812 nucleotides of SEQ ID NO:85. In addition, the amino acid sequence of the protein encoded by the zebrafish nucleic acid sequence is 84% identical and 94% similar to that of the human homologue (GenBank Accession No. NP_(—)036377.1) over a region spanning 536 amino acids of SEQ ID NO:86.

[0685] Zebrafish mutant for this gene display a curved body, extensive brain necrosis, a lack of brain divisions, and abnormal mobility by day two of development.

[0686] The 297 Gene

[0687] We isolated zebrafish mutants containing an insertion of an F5 virus approximately at nucleotide 74 of the 297 gene (SEQ ID NO:87). The coding region of the 297 gene may span either from nucleotide 108 to 2153 or from nucleotide 288 to 2153 of SEQ ID NO:87. The 297 gene is 80% identical to the human gene encoding the FLJ10498 protein (GenBank Accession No. XM_(—)011142.2) over a region spanning 173 nucleotides of SEQ ID NO:87. Furthermore, the protein encoded by the 297 gene is 77% identical and 89% similar to the FLJ10498 or AK001360 protein (GenBank Accession Nos: NP_(—)060585.1 and BAA91648.1) over a region spanning 624 amino acids of SEQ ID NO:88.

[0688] Zebrafish mutant for the 297 gene show brain necrosis, movement abnormalities, less eye pigment, and a tail that is kinked down. In addition, these mutants are missing the branchial arches, the ethmoid plate, and most of the jaw by day three of development.

[0689] The TCP-1 Complex Gamma Chain Gene

[0690] We isolated zebrafish mutants (alleles hi 383A and hi 1867) containing a viral insertion in the TCP-1 Complex Gamma Chain gene. The hi 383A allele contains an F5 virus and the hi 1867 allele contains a GT virus inserted approximately between nucleotides 75 and 76 of SEQ ID NO:89. The coding region of the zebrafish TCP-1 Complex Gamma Chain gene spans nucleotides 51 to 1676 of SEQ ID NO:89. The TCP-1 Complex Gamma Chain gene is 76% identical to the Xenopus laevis Chaperonin Subunit CCT Gamma gene (GenBank Accession No. U37062) over a region spanning 1598 nucleotides, and is 75% identical to the Homo sapiens Chaperonin Containing TCP1, subunit 3 (gamma) gene (GenBank Accession No. AAH06501.1) over a region spanning 1573 nucleotides, of SEQ ID NO:89. In addition, this zebrafish protein is 87% identical and 95% similar to the Xenopus laevis homologue over a region spanning 529 amino acids, and is 84% identical and 92% similar to the Homo sapiens homologue over a region spanning 541 amino acids, of SEQ ID NO:90.

[0691] Zebrafish having a mutation in this gene have a thinner yolk sac extension.

[0692] The Small Nuclear Ribonucleoprotein D1 Gene

[0693] We isolated zebrafish mutants containing an insertion of an F5 virus approximately between nucleotides 76 and 77 of the Small Nuclear Ribonucleoprotein D1 gene (SEQ ID NO:91). The coding region of the zebrafish Small Nuclear Ribonucleoprotein D1 gene spans nucleotides 63-420 of SEQ ID NO:91. The zebrafish Small Nuclear Ribonucleoprotein D1 gene is 88% identical to the Mus musculus homologue (GenBank Accession No. NM_(—)009226.1), and is 87% identical to the Homo sapiens homologue (GenBank Accession No. XM_(—)032156.1), over a region spanning 152 nucleotides of SEQ ID NO:91. In addition, the amino acid sequence of the zebrafish Small Nuclear Ribonucleoprotein D1 (SEQ ID NO:92) is 97% identical and 98% similar to the Homo sapiens (GenBank Accession Nos. NP_(—)008869.1, XP_(—)008813.1, XP_(—)032156.1, P13641, A27668, AAA36620.1, and AAH01721.1) or Mus musculus (GenBank Accession Nos. NP_(—)033252.1, AAA96493.1, BAB24092.1, BAB25178.1, BAB27628.1, and BAB28635.1) homologues.

[0694] Zebrafish mutant for this gene have an inflated hind-brain and increased necrosis in the CNS, particularly in the eye, when compared to wild-type zebrafish.

[0695] The DNA Polymerase Epsilon Subunit B Gene

[0696] We isolated zebrafish mutants (alleles hi 783 and hi 1703) containing a viral insertion in the DNA Polymerase Epsilon Subunit B gene. The hi 785 allele contains an insertion of an F5 virus approximately at nucleotide 929, and the hi 1703 allele contains an insertion of a GT virus between nucleotides 1161 and 1162, of SEQ ID NO:93. The coding region of the zebrafish DNA Polymerase Epsilon Subunit B gene spans nucleotides 32 to 1612 of SEQ ID NO:93. The DNA Polymerase Epsilon Subunit B gene is 70% identical to the Mus musculus homologue (GenBank Accession No. AF036898) over a region spanning 1038 nucleotides, is 71% identical to the Xenopus laevis homologue (GenBank Accession No. AB048257) over a region spanning 541 nucleotides, and is 79% identical to the Homo sapiens homologue (GenBank Accession No. AF036899) over a region spanning 96 nucleotides, of SEQ ID NO:93. In addition the protein encoded by the zebrafish DNA Polymerase Epsilon Subunit B gene is 74% identical and 89% similar to the Xenopus laevis homologue (GenBank Accession No. BAB12726.1) over a region spanning 527 amino acids, is 74% identical and 89% similar to the Mus musculus homologue (GenBank Accession No. AAC40045.1) over a region spanning 522 amino acids, and is 73% identical and 87% similar to the Homo sapiens homologue (GenBank Accession Nos. XP_(—)012327.3, P56282, AAC39610.1, and AAK72254.1) over a region spanning 526 amino acids of SEQ ID NO:94.

[0697] Zebrafish mutant for this gene show severe necrosis of the brain and eye by day two of development.

[0698] The 821-02 Gene

[0699] We isolated zebrafish mutants (alleles hi 821-02 and hi 2144) containing a viral insertion in the 821-02 gene. The hi 821-02 allele contains a GT virus inserted approximately between nucleotides 369 and 370, and the hi 2144 allele contains a GT virus inserted approximately between nucleotides 231 and 232, of SEQ ID NO:95. The coding region of the 821-02 gene spans nucleotides 33 to 1982 of SEQ ID NO:95. The zebrafish 821-02 nucleic acid sequence is 76% identical to that encoding the Homo sapiens D26488 protein (GenBank Accession No. BAA05499.1) over a region spanning 99 nucleotides of SEQ ID NO:95. In addition, the zebrafish 821-02 amino acids sequence is 52% identical and 68% similar to the Homo sapiens D26488 amino acid sequence (GenBank Accession No. BAA05499.1) over a region spanning 683 amino acids of SEQ ID NO:96. Furthermore, the zebrafish 821-02 amino acid sequence contains WD40 repeats at positions 105-145 and 148-185 of SEQ ID NO:96.

[0700] Zebrafish mutant for this gene have extensive apoptosis in the CNS and the eye by 24 to 48 hours into development, as visualized by acridine orange staining.

[0701] The 1045 Gene

[0702] We isolated zebrafish mutants containing an insertion of a GT virus approximately between nucleotides 216 and 344 of the zebrafish 1045 gene (SEQ ID NO:97). The coding region of the zebrafish 1045 gene spans nucleotides 2-1039 of SEQ ID NO:97. In addition, the zebrafish 1045 gene is 75% identical to the Homo sapiens serine/threonine kinase 12 sequence (GenBank Accession No. XM_(—)008569.4) over a region spanning 573 nucleotides, and is 75% identical to the Homo sapiens serine/threonine kinase AIE2 sequence (GenBank Accession No. AF054621.1) over a region spanning 511 nucleotides, of SEQ ID NO:97. Furthermore, the protein encoded by the zebrafish 1045 nucleic acid sequence is 73% identical and 86% similar to the Homo sapiens Aurora/Ip11-Related Kinase 3 protein (GenBank Accession No. BAA76292.1), the Homo sapiens STK13 protein (GenBank Accession No. AAC25618.1), and the Homo sapiens Serine/Threonine Kinase AIE2 protein (GenBank Accession No. AAC 25955.1), over a region spanning 266 amino acids of SEQ ID NO:98. The zebrafish 1045 gene product is also 74% identical and 86% similar to Mus musculus homologues (GenBank Accession Nos. BAA04658.1, AAC12683.1, AAH03261.1, and JC4665) over a region spanning 270 amino acids, and is 76% identical and 88% similar to the Rattus norvegicus AIM-1 protein (GenBank Accession No. BAA23794.1) over a region spanning 265 amino acids, of SEQ ID NO:98.

[0703] Zebrafish mutant for the 1045 gene have severe brain and head necrosis at 24 hours post fertilization.

[0704] The 1055-1 Gene (Zebrafish MAK16 Homologue)

[0705] We isolated zebrafish mutants containing an insertion of a GT virus approximately between nucleotides 167 and 168 of the zebrafish 1055-1 gene (SEQ ID NO:99). The coding region of the 1055-1 gene spans nucleotides 152-1204 of SEQ ID NO:99. In addition, the 1055-1 nucleic acids sequence is 74% identical to the DNA sequence encoding a Homo sapiens RNA binding protein (GenBank Accession No. AF251062.1) over a region spanning 552 nucleotides of SEQ ID NO:99. Furthermore, the protein encoded by the zebrafish 1055-1 gene is 70% identical and 83 to 84% similar to Homo sapiens RNA binding proteins (GenBank Accession Nos. NP_(—)115898.1, AAK34952.1, BAB55134.1, XP_(—)050217.1, and XP_(—)050216.1) over a region spanning 285 amino acids of SEQ ID NO:100.

[0706] Zebrafish mutant for the 1055-1 gene have a misshapen or missing yolk sac extension and a tail that bends down by day two or three of development. As is noted above, the zebrafish 1055-1 gene product is highly homologous (54% identical and 75% similar over 191 amino acids of SEQ ID NO:100) to the Saccharomyces cerevisiae MAK16 protein (GenBank Accession Nos. AAA34752.1 and AAC05007.1). The yeast protein has been shown to be involved in both cell cycle progression and in the biogenesis of 60S ribosomal subunits.

[0707] The Spliceosome Associated Protein 49 Gene

[0708] We isolated zebrafish mutants containing an insertion of a GT virus approximately between nucleotides 53 and 54 of the Spliceosome Associated Protein 49 gene (SEQ ID NO:101). The coding region of this zebrafish gene spans nucleotides 20-1216 of SEQ ID NO:101. In addition, the zebrafish Spliceosome Associated Protein 49 gene is 78% identical to the Homo sapiens Splicing Factor 3b, subunit 4 gene (GenBank Accession No. BC004273) over a region spanning 651 nucleotides of SEQ ID NO:101. Furthermore the protein product encoded by the zebrafish Spliceosome Associated Protein 49 gene is 80% identical and 82% similar to the Homo sapiens Splicing Factor 3b, subunit 4 gene product (GenBank Accession Nos. NP_(—)005841.1, XP_(—)001943.1, XP_(—)051919.1, Q15427, A54964, AAA60300.1, and AAH04273.1) over a region spanning 322 amino acids of SEQ ID NO:102.

[0709] Zebrafish mutant for this gene have tectal necrosis and a bent body by day two of development.

[0710] The DNA Replication Licensing Factor MCM7 Gene

[0711] We isolated zebrafish mutants (alleles hi 1411 and hi 2704) containing a viral insertion in the DNA Replication Licensing Factor MCM7 gene. The hi 1411 allele contains an insertion of a GT virus approximately between nucleotides 121 and 122, and the hi 2704 allele contains an insertion of a GT virus approximately at nucleotide 198, of SEQ ID NO:103. The coding region of this zebrafish gene spans nucleotides 93 to 677 of SEQ ID NO:103. However, the DNA sequence encoding the full-length protein may be longer than that of SEQ ID NO:103. One skilled in the art of molecular biology would readily be able to obtain a longer cDNA sequence for this gene, using standard techniques, once provided with the sequence of SEQ ID NO:103.

[0712] The zebrafish DNA Replication Licensing Factor MCM7 gene is 74% identical to the Homo sapiens P1cdc47 gene (GenBank Accession No. D55716.1) over a region spanning 286 nucleotides of SEQ ID NO:103 In addition, the protein encoded by this zebrafish gene is 75% identical and 85% similar to the Xenopus laevis CDC47-2p protein (GenBank Accession No. AAC60228.1) over a region spanning 192 amino acids, and is 72% identical and 83% similar to the Homo sapiens DNA Replication Licensing Factor MCM7 (GenBank Accession No. P33993) over a region spanning 194 amino acids, of SEQ ID NO:104.

[0713] Zebrafish mutant for this gene have severe necrosis in the eye and in the CNS by late day one/early day two of development.

[0714] The DEAD-Box RNA Helicase (DEAD5 or DEAD19) Gene

[0715] We isolated zebrafish mutants containing an insertion of a GT virus approximately at nucleotide 132 of the zebrafish DEAD-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid sequence (SEQ ID NO:105). The coding region of this zebrafish nucleic acid sequence spans nucleotides 131 to 1592 of SEQ ID NO:105. In addition, the zebrafish protein (SEQ ID NO:106) encoded by the DEAD-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid sequence contains a DEAD Box between amino acids 113 and 318, as well as a helicase C domain between amino acids 357 and 442). Furthermore, the zebrafish DEAD-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid sequence is 81% identical a human homologue (GenBank Accession No. AJ237946) over a region spanning 810 nucleotides, and is 78% identical to the human sequence with GenBank Accession No. NM_(—)007242.2 over a region spanning 1312 nucleotides, of SEQ ID NO:105. The protein encoded by the DEAD-Box RNA Helicase (DEAD5 or DEAD19) nucleic acid sequence is 84% identical and 92% similar to the Homo sapiens DEAD/H Box Polypeptide 19 and DEAD-Box Protein 5 (GenBank Accession Nos. NP_(—)009173.1, XP_(—)028024.1, XP_(—)028026.1, Q9UMR2, AAH03626.1, and CAB52189.1) over a region spanning 487 amino acids of SEQ ID NO:106.

[0716] Zebrafish mutant for this gene show brain necrosis on day one of development and are dead by day two of development.

[0717] The 1581 Gene

[0718] We isolated zebrafish mutants containing an insertion of a GT virus approximately between nucleotides 346 and 347 of the 1581 nucleic acid sequence (SEQ ID NO:107). The coding region of the zebrafish 1581 nucleic acid sequence spans nucleotides 106 to 912 of SEQ ID NO:107. In addition, the zebrafish 1581 polypeptide may have an RNA recognition motif between amino acids 46 and 118 of SEQ ID NO:108.

[0719] The 1581 nucleic acid sequence is 77% identical to a Homo sapiens nucleic acid sequence for a nucleolar protein interacting with the FHA domain of pKi-67 (GenBank Accession No. NM_(—)032390) over a region spanning 165 nucleotides of SEQ ID NO:107. Furthermore, the zebrafish 1581 polypeptide is 48% identical and 62% similar to the Homo sapiens nucleolar phosphoprotein Nopp34 (GenBank Accession Nos. NP_(—)115766.1, XP_(—)037099.1, and BAB41210.1) over a region spanning 273 amino acids of SEQ ID NO:108.

[0720] Zebrafish mutant for the 1581 gene show transient brain and eye necrosis between days one and three of development. By day three of development, these mutant zebrafish have heads and eyes that are 50% smaller than those of identically aged wild-type zebrafish.

[0721] The Cyclin A2 Gene

[0722] We isolated zebrafish mutants containing an insertion of a GT virus approximately at position 374 or 401 of the Cyclin A2 gene (GenBank Accession No. AF234784). The coding region of this zebrafish gene spans nucleotides 250 to 1536 of SEQ ID NO:109 and the amino acid sequence is provided in SEQ ID NO:110.

[0723] Zebrafish mutant for the Cyclin A2 gene display eye and CNS necrosis, no jaw development, and abnormal semicircular canals by day three of development. By day five of development, the head is at least 66% smaller than that of a five day-old wild-type zebrafish.

[0724] The ISWI/SNF2 Gene

[0725] We isolated zebrafish mutants containing an insertion of a GT virus approximately at position 76 of the Imitation Switch ISWI/SNF2 nucleic acid sequence (SEQ ID NO:111). The coding region of this zebrafish gene spans nucleotides 161 to 655 of SEQ ID NO:111. The naturally-occurring zebrafish protein may be considerably longer than the one encoded by SEQ ID NO:111. One skilled in the art of molecular biology would be able to isolate a longer cDNA sequence for the zebrafish ISWI/SNF2 nucleic acid based on the sequence of SEQ ID NO:111, using standard techniques. The zebrafish ISWI/SNF2 nucleic acid sequence is 84% identical to the Xenopus laevis Initiation Switch (ISWI) nucleic acid sequence (GenBank Accession No. AF292095.2) over a region spanning 196 nucleotides, and is 82% identical to the Homo sapiens hSNF2H nucleic acid sequence (GenBank Accession No. AB010882) over a region spanning 224 nucleotides, of SEQ ID NO:111. In addition, the zebrafish ISWI/SNF2 polypeptide is 60% identical and 76% similar to the Xenopus laevis Imitation Switch ISWI polypeptide (GenBank Accession No. AAG01537.2) over a region spanning 145 amino acids, and is 70% identical and 76% similar to the Homo sapiens SWI/SNF related, actin-dependent regulator of chromatin (GenBank Acession Nos. NP_(—)003592.1 and BAA25173.1) over a region spanning 129 amino acids, of SEQ ID NO:112.

[0726] On day three of development, zebrafish mutant for the ISWI/SNF2 gene show extensive necrosis in the eye, in particular on the medial 75% of the inner cell ganglion layer and in the optic tectum. By day four of development the eye is at least 25% smaller that that of a four day-old wild-type zebrafish, and the lower jaw has dropped.

[0727] The Chromosomal Assembly Protein C (XCAP-C) Gene

[0728] We isolated zebrafish mutants containing an insertion of a GT virus approximately between nucleotides 181 and 182 of the zebrafish XCAP-C nucleic acid sequence (SEQ ID NO:113). The coding region of the zebrafish XCAP-C nucleic acid sequence spans nucleotides 192 to 3326 of SEQ ID NO:113. The zebrafish XCAP-C nucleic acid sequence is 78% identical to the Xenopus laevis homologue (GenBank Accession No. U13673.1) over a region spanning 554 nucleotides, is 74% identical to this Xenopus gene over a region spanning 765 nucleotides, and is 72% identical to the Homo sapiens CAP-C nucleic acid sequence (GenBank Accession No. NM_(—)005496.1) over a region spanning 760 nucleotides, of SEQ ID NO:113. In addition, the zebrafish XCAP-C polypeptide sequence is 70% identical and 84% similar to the Xenopus laevis homologue (GenBank Accession Nos. P50532, A55094, and AAA64679.1), and is 65% identical and 81% similar to the Homo sapiens homologue (GenBank Accession Nos. CAB66811.1, NP_(—)005487.1, and BAA73535.1) over a region spanning 979 amino acids of SEQ ID NO:114.

[0729] Zebrafish mutant for this gene show necrosis in the optic tectum, eye, and hind-brain by day two of development.

[0730] The DNA Replication Licensing Factor MCM2 Gene

[0731] We isolated zebrafish mutants (alleles hi 1244 and hi 3205) containing a viral insertion in the DNA Replication Licensing Factor MCM2 nucleic acid sequence. The hi 1244 and hi 3205 alleles are the result of an insertion of a GT virus in the intron preceding nucleotide 399 of SEQ ID NO:115. The coding region of the zebrafish DNA Replication Licensing Factor MCM2 nucleic acid sequence begins with nucleotide 36 of SEQ ID NO:115. In addition, the zebrafish DNA Replication Licensing Factor MCM2 gene is 78% identical to the human homologue (GenBank Accession No. D83987.1) over a region spanning 1164 nucleotides of SEQ ID NO:115. Furthermore, the zebrafish DNA Replication Licensing Factor MCM2 polypeptide is 79% identical and 86% similar to the Homo sapiens MCM2 protein (GenBank Accession Nos. P49736 and BAA12177.1) over a region spanning 893 amino acids of SEQ ID NO:116.

[0732] Zebrafish mutant for this gene have small eyes, necrosis in the optic tectum, and abnormal jaws and branchial arches by day five of development.

[0733] The DNA Replication Licensing Factor MCM3 Gene

[0734] We isolated zebrafish mutants (alleles hi 319 and hi 3068) containing a viral insertion in the DNA Replication Licensing Factor MCM3 nucleic acid sequence. The hi 319 allele is the result of an F5 virus insertion approximately at nucleotide 50, and the hi 3068 allele is the result of a GT virus insertion approximately between nucleotides 75 and 76 of SEQ ID NO:117. In addition, the entire nucleic acid sequence of SEQ ID NO:117 is part of the coding sequence of the DNA Replication Licensing Factor MCM3 gene. The zebrafish DNA Replication Licensing Factor MCM3 nucleic acid sequence is 78% identical to the Mus musculus P1 nucleic acid sequence (GenBank Accession No. X62154.1), and is 77% identical to the Homo sapiens MCM3 nucleic acid sequence (GenBank Accession No. NM_(—)002388.2) over a region spanning 574 nucleotides of SEQ ID NO:117. Furthermore, the zebrafish DNA Replication Licensing Factor MCM3 polypeptide is 83% identical and 93% similar to the Xenopus homologue over a region spanning 178 amino acids, and is 86% identical and 94% similar to the human and mouse homologues (GenBank Accession Nos. NP_(—)002379.2, XP_(—)004096.3, XP_(—)037069.1, XP_(—)037070.1, XP_(—)037068.1, CAA44078.2, CAB75298.1, and AAH01626.1 (human); and CAA44079.1 (mouse)) over a region spanning 167 amino acids, of SEQ ID NO:118.

[0735] Zebrafish mutant for this gene have necrosis in the optic tectum on day two of development. By day three of development, no further necrosis is visible in the optic tectum, but the head and eyes are at least 25% smaller than those of three day-old wild-type zebrafish.

[0736] The Valyl-tRNA Synthase Gene

[0737] We isolated zebrafish mutants containing an insertion of an F5 virus approximately between nucleotides 30 and 31 of the Valyl-tRNA Synthase nucleic acid sequence (SEQ ID NO:119; GenBank Accession No. AF210648). The zebrafish Valyl-tRNA Synthase polypeptide is 67% identical and 75% similar to the Takifugu rubripes homologue (GenBank Accession Nos. P49696 and CAA62967.1) over a region spanning 425 amino acids, and is 52% identical and 62% similar to the Mus musculus homologue (GenBank Accession Nos. ADD26531.1, ADD26532.1, NP_(—)035820.1, Q9Z1Q9, and AAC84151.1) over a region spanning 437 amino acids, of SEQ ID NO:120. This zebrafish polypeptide is also 52% identical and 62% similar to the Homo sapiens homologue (GenBank Accession Nos. CAA41990.1, NP_(—)006286.1, P26640, and AAD21819.1) over a region spanning 440 amino acids of SEQ ID NO:120.

[0738] By day three of development, zebrafish with a mutation in this gene have apoptosis in the brain, a smaller head, and lighter, smaller eyes.

[0739] The 40S Ribosomal Protein S5 Gene

[0740] We isolated zebrafish mutants (alleles 577B (or 577-03) and 1364A) containing a viral insertion in the 40S Ribosomal Protein S5 nucleic acid sequence. The 577B allele is the result of an F5 virus, and the 1364A allele is the result of a GT virus, inserted in an intron between nucleotides 31 and 32 of SEQ ID NO:121. The coding region of the zebrafish 40S Ribosomal Protein S5 nucleic acid sequence spans nucleotides 36 to 645 of SEQ ID NO:121. The zebrafish 40S Ribosomal Protein S5 nucleic acid sequence is 84% identical to that of the human homologue (GenBank Accession Nos. NM_(—)001009.2, XM_(—)034266.1, and XM_(—)034265.1) over a region spanning 593 nucleotides of SEQ ID NO: 121. In addition, the zebrafish 40S Ribosomal Protein S5 is 96% identical and 98% similar to the Mus musculus (GenBank Accession Nos. CAA73041.1, BAB21953.1, BAB26424.1, BAB27113.1, BAB28229.1, BAB28270.1, BAB32115.1, and BAB32203.1) and Homo sapiens (GenBank Accession Nos. NP_(—)001000.2, XP_(—)034266.1, and XP_(—)034265.1) homologues over a region spanning 202 amino acids of SEQ ID NO:122.

[0741] Zebrafish mutant for this gene have a swollen “bubble-brain” and motility problems by day two of development. By day three of development, the brain looks relatively normal, but the motility defect persists.

[0742] The TCP-1 Beta Gene

[0743] We isolated zebrafish mutants (alleles hi 642, hi 1269, and hi 2108) containing a viral insertion in the TCP-1 Beta gene. The hi 642 allele is the result of an F5 virus, and the hi 1269 and hi 2108 alleles are the result of a GT virus, inserted in an intron between nucleotides 63 and 64 of SEQ ID NO:123. The coding region of the TCP-1 Beta nucleic acid sequence includes nucleotides 60-1582 of SEQ ID NO:123. One skilled in the art of molecular biology can isolate additional coding sequence for the zebrafish TCP-1 Beta gene using the sequences provided herein.

[0744] In addition, the zebrafish TCP-1 Beta nucleic acid sequence is 73% identical to the Mus musculus Chaperonin Subunit 2 Beta nucleic acid sequence (GenBank Accession No. BC007470) over a region spanning 1353 nucleotides, and is 74% identical the Homo sapiens chaperonin containing TCPI subunit 2 Beta nucleic acid sequence (GenBank Accession No. XM_(—)006861.4) over a region spanning 831 nucleotides, of SEQ ID NO:123. Furthermore, the polypeptide encoded by the zebrafish TCP-1 Beta nucleic acid sequence is 86% identical and 92% similar to the human homologue (GenBank Accession Nos. NP_(—)006422.1, XP_(—)046041.1, XP_(—)006861.4, P78371, AAC96012.1, and AAC98906.1) over a region spanning 507 amino acids of SEQ ID NO:124.

[0745] Zebrafish mutant for this gene have jaw and cartilage defects and a small head and eyes by day three of development, when compared to identically aged wild-type zebrafish.

[0746] The TCP-1 Eta Gene

[0747] We isolated zebrafish mutants (alleles hi 800A and hi 2191) containing a viral insertion in the TCP-1 Eta gene. The hi 800A allele is the result of an F5 virus insertion, and the hi 2191 allele is the result of a GT virus insertion in the intron, present in the genomic sequence, between nucleotides 32 and 33 of SEQ ID NO:125. The coding region of the zebrafish TCP-1 Eta nucleic acid sequence spans nucleotides 31 to 1668 of SEQ ID NO:125. The zebrafish TCP-1 Eta gene is 79% identical to the Mus musculus homologue (GenBank Accession No. BC008255.1) over a region spanning 1584 nucleotides, and is 78% identical to the Homo sapiens homologue (GenBank Accession No. XM_(—)002345.4) over a region spanning 1585 nucleotides, of SEQ ID NO:125. In addition, the zebrafish TCP-1 Eta polypeptide is 87% identical and 94% similar to the Mus musculus homologue (GenBank Accession Nos. NP_(—)031664.1, P80313, S43058, CAA83274.1, BAA81878.1, and AAH08255.1) over a region spanning 541 amino acids, and is 88% identical and 95% similar to the Homo sapiens homologue (GenBank Accession Nos. NP_(—)006420.1, XP_(—)002345.2, Q99832, and AAC96011.1) over a region spanning 532 amino acids, of SEQ ID NO:126. Zebrafish mutant for this gene have small heads and eyes.

[0748] The Translation Elongation Factor eEF1 Alpha Gene

[0749] We isolated zebrafish mutants containing an insertion of a GT virus in an intron, present in the genomic sequence, between nucleotides 60 and 61 of the Translation Elongation Factor eEF1 Alpha gene (SEQ ID NO:127). The coding region of the zebrafish 1257 gene spans nucleotides 196 to 1326 of SEQ ID NO:127. In addition, the sequence of SEQ ID NO:127 is identical to that of GenBank Accession No. L23807.1, except for the addition of fourteen nucleotides at the 5′end of the sequence.

[0750] Zebrafish mutant for this gene have a head and eyes that are at least 33% smaller than those of a comparably aged wild-type zebrafish by day three of development. In addition, these zebrafish mutants display increased apoptosis in the head and eyes, but not in the neural tube, between 48 and 72 hours into development.

[0751] The 1257 Gene

[0752] We isolated zebrafish mutants containing an insertion of a GT virus approximately at nucleotide 175 of the zebrafish 1257 nucleic acid sequence (SEQ ID NO: 129). The coding region of this zebrafish nucleic acid sequence spans nucleotides 196 to 1326 of SEQ ID NO:129. In addition, the polypeptide encoded by the zebrafish 1257 nucleic acid sequence is 49% identical and 67% similar to the Homo sapiens AK027570 protein (GenBank Accession Nos. BAB55206.1 and XP_(—)002467.3) over a region spanning 371 or 372 amino acids, respectively, of SEQ ID NO:130.

[0753] By day four of development, zebrafish mutant for the 1257 gene have a head and eyes that are at least 25% smaller than those of a four day-old wild-type zebrafish. In addition, these mutant zebrafish have an underdeveloped jaw.

[0754] The 60S Ribosomal Protein L24 Gene

[0755] We isolated zebrafish mutants containing a viral insertion approximately between nucleotides 144 and 145 of the 60S Ribosomal Protein L24 nucleic acid sequence (SEQ ID NO:131). The zebrafish 60S Ribosomal Protein L24 gene is 82% identical to the Gillichthys mirabilis homologue (GenBank Accession No. AF266175) over a region spanning 344 nucleotides, and is 78% identical to the Homo sapiens homologue (GenBank Accession No. NM_(—)000986.1) over a region spanning 363 nucleotides, of SEQ ID NO:131. In addition, the zebrafish 60S Ribosomal Protein L24 is 89% identical and 94% similar to the Homo sapiens (GenBank Accession Nos. NP_(—)000977.1, XP_(—)015463.1, XP_(—)040555.1, P38663, JN0549, AAC28251.1, and AAH00690.1), Rattus norvegicus (GenBank Accession Nos. NP_(—)071960.1, JC2444, and CAA55203.1), Bos taurus (GenBank Accession No. AAC16388.1), and Mus musculus (GenBank Accession Nos. AAH02110.1 and BAB31374.1) homologues over a region spanning 157 amino acids of SEQ ID NO:132.

[0756] By day three of development, zebrafish mutant for this nucleic acid sequence have a head and eyes that are at least 33% smaller than those of identically aged wild-type zebrafish.

[0757] The Non-Muscle Adenylosuccinate Synthase Gene

[0758] We isolated zebrafish mutants (alleles hi 1433 and hi 3081) containing a viral insertion in the Non-Muscle Adenylosuccinate Synthase nucleic acid sequence. The hi 1433 allele is the result of a GT virus insertion in an intron, present in the genomic sequence, between nucleotides 217 and 218, and the hi 3081 allele is the result of a GT virus insertion approximately at nucleotide 209, of SEQ ID NO:133. The coding region of the Non-Muscle Adenylosuccinate Synthase nucleic acid sequence spans nucleotides 16 to 1399 of SEQ ID NO:133. In addition, the zebrafish Non-Muscle Adenylosuccinate Synthase nucleic acid sequence is 76% identical to the Mus musculus Adenylosuccinate Synthase 1, Muscle (Adssl) nucleic acid sequence (GenBank Accession No. NM_(—)007421.1) over a region spanning 333 nucleotides, and is 74% identical to the Homo sapiens ADSS nucleic acid sequence (GenBank Accession No. NM_(—)001126.1) over a region spanning 476 nucleotides, of SEQ ID NO:133. Furthermore, the zebrafish Non-Muscle Adenylosuccinate Synthase polypeptide is 76% identical and 88% similar to Mus musculus (GenBank Accession Nos. BAB23635.1 and BAB26805.1) and Homo sapiens (GenBank Accession No. S21166) homologues over a region spanning 433 and 429 amino acids, respectively, of SEQ ID NO:134.

[0759] By day two of development, zebrafish mutant for this gene have a head and eyes that are at least 50% smaller than those of two day-old wild-type zebrafish. In addition, at this time in development, these mutants have some apoptotic cells and lack jaws and branchial arches.

[0760] The Nuclear Cap Binding Protein Subunit 2 Gene

[0761] We isolated zebrafish mutants containing an insertion of a GT virus in an intron, present in the genomic sequence, between nucleotides 137 and 138 of the Nuclear Cap Binding Protein Subunit 2 nucleic acid sequence (SEQ ID NO:135). The coding region of the zebrafish Nuclear Cap Binding Protein Subunit 2 nucleic acid sequence includes nucleotides 80 to 526 of SEQ ID NO:135. In addition, the zebrafish Nuclear Cap Binding Protein Subunit 2 gene is 74% identical to the Xenopus laevis homologue (GenBank Accession No. X84788) over a region spanning 390 nucleotides, and is 72% identical to the Homo sapiens homologue (GenBank Accession No. NM_(—)007362.1) over a region spanning 420 nucleotides, of SEQ ID NO:135. Furthermore, the zebrafish Nuclear Cap Binding Protein Subunit 2 is 85% identical and 92% similar to the Xenopus laevis homologue (GenBank Accession Nos. P52299, 151602, CAA59259.1, and 2118330B), and is 81% identical and 92% similar to the Homo sapiens homologue (GenBank Accession Nos. XP_(—)003131.3, XP_(—)028278.1, XP_(—)028279.1, P52298, 137222, CAA58962.1, AAH01255.1, and 2118330A) over a region spanning 143 amino acids of SEQ ID NO:136.

[0762] Zebrafish mutant for this nucleic acid sequence have some transient necrosis in the CNS between 24 and 48 hours of development, resulting in a head and eyes that are at least 25% smaller than those of identically aged wild-type zebrafish by day four of development. In addition, these mutant zebrafish have an underdeveloped jaw and lack branchial arches three and four. Furthermore, the stomach is also underdeveloped by day five of development.

[0763] The Ornithine Decarboxylase Gene

[0764] We isolated zebrafish mutants containing an insertion of a GT virus, present in an intron in the genomic sequence, between nucleotides 97 and 98 of the Ornithine Decarboxylase nucleic acid sequence (SEQ ID NO:137). The coding region of the zebrafish Ornithine Decarboxylase nucleic acid sequence spans nucleotides 264 to 1646 of SEQ ID NO:137. This nucleic acid sequence is identical to that of GenBank Accession No. AF290981, except for the addition of 101 nucleotides at the 5′ end. In addition, the amino acid sequence also is provided as SEQ ID NO:138.

[0765] By day three of development, zebrafish mutant for this nucleic acid sequence have heads and eyes that are at least 33% smaller than those of identically aged wild-type zebrafish. In addition, these mutants also have underdeveloped jaws and branchial arches, relative to wild-type zebrafish.

[0766] The Protein Phosphatase 1 Nuclear Tareting Subunit (PNUTS) Gene

[0767] We isolated zebrafish mutants containing an insertion of an F5 virus approximately at nucleotide 303 of the Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) nucleic acid sequence (SEQ ID NO:139). The coding region of the zebrafish Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) nucleic acid sequence includes nucleotides 322 to 2364 of SEQ ID NO:139. One skilled in the art of molecular biology would be able to identify any additional coding sequence using the sequence provided herein.

[0768] In addition, the Zebrafish Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) nucleic acid sequence is 76% identical to the Rattus norvegicus Protein Phosphatase 1 Nuclear Targeting Subunit nucleic acid sequence (GenBank Accession No. NM_(—)022951.1) over a region spanning 233 nucleotides, and is 74% identical to Homo sapiens Protein Phosphatase 1 Regulatory Subunit 10 nucleic acid sequence (GenBank Accession No. NM_(—)002714.1) over a region spanning 240 nucleotides, of SEQ ID NO:139. Furthermore, the zebrafish Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS) polypeptide is 56% identical and 70% similar to the Rattus norvegicus Protein Phosphatase 1 Nuclear Targeting Subunit (GenBank Accession Nos. NP_(—)075240.1 and AAB96775.1) over a region spanning 636 amino acids, and is 55% identical and 68% similar to the Homo sapiens Protein Phosphatase 1 Regulatory Subunit 10 over a region spanning 637 amino acids, of SEQ ID NO:140.

[0769] By day three of development, zebrafish mutant for the PNUTS gene have slightly smaller heads. In addition, these mutants curve ventrally, have a slightly compressed jaw, and an underdeveloped gut by day five of development.

[0770] The Mitochondrial Inner Membrane Translocating Protein (rTIM23) Gene

[0771] We isolated zebrafish mutants containing an insertion of a GT virus approximately at nucleotide 100 of the Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid sequence (SEQ ID NO:141). The coding region of this zebrafish nucleic acid sequence spans nucleotides 252 to 875 of SEQ ID NO:141. In addition, the zebrafish Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid sequence is 76% identical to the Rattus norvegicus homologue (GenBank Accession No. NM_(—)019352.1) over a region spanning 416 nucleotides of SEQ ID NO:141. Furthermore, the zebrafish Mitochondrial Inner Membrane Translocating Protein (rTIM23) is 79% identical and 89% similar to the Rattus norvegicus homologue (GenBank Accession Nos. NP_(—)062225.1, JE0154, and BAA21819.1) over a region spanning 190 amino acids of SEQ ID NO:142.

[0772] Zebrafish mutant for this gene have lighter eyes than wild-type zebrafish and a circulatory defect in the tail blood vessel on day three of development. In addition, most of these mutant zebrafish are dying by day four of development.

[0773] The 1447 Gene

[0774] We isolated zebrafish mutants containing an insertion of a GT virus in an intron, present in the genomic sequence, between nucleotides 227 and 228 of the 1447 nucleic acid sequence (SEQ ID NO:143). The coding region of the zebrafish 1447 nucleic acid sequence spans nucleotides 102 to 2687 of SEQ ID NO:143. The zebrafish 1447 nucleic acid sequence is 76% identical to the human sequence that is similar to Riken cDNA clone 2410015A15 (GenBank Accession No. BC005848.1) over a region spanning 910 nucleotides of SEQ ID NO:143. Furthermore, the zebrafish 1447 polypeptide is 59% identical and 73% similar to the Homo sapiens sequence (GenBank Accession No. AAH05848.1) which is similar to the 2410015A15 sequence over a region spanning 738 amino acids of SEQ ID NO:144. The 1447 amino acid sequence also has a DEAD-box motif at amino acids 91 to 288, and a helicase C motif at amino acids 334 to 415, of SEQ ID NO:144, indicating that the zebrafish protein is likely to be an RNA helicase.

[0775] By day three of development, zebrafish mutant for this gene have an abnormal jaw, as well as a head and eyes that are at least 25% smaller than those of three day-old wild-type zebrafish. In addition, by day five of development, we observe that these mutants either lack or have severely reduced mandibular and branchial arches. Furthermore, at this time in development, these zebrafish mutant for the 1447 nucleic acid have shorter fins, an underdeveloped stomach, lack a pancreas, and have bent tails.

[0776] The ARS2 Gene

[0777] We isolated zebrafish mutants (alleles hi 591 and 2765) containing a viral insertion in the ARS2 (Arsenate Resistance Protein 2) nucleic acid sequence. The hi 591 allele is the result of an F5 virus, and the 2765 allele is the result of a GT virus, inserted in an intron, present in the genomic sequence, between nucleotides 103 and 104 of SEQ ID NO:145. The coding region of the zebrafish ARS2 nucleic acid sequence spans nucleotides 141 to 2828 of SEQ ID NO:145. In addition, the zebrafish ARS2 nucleic acid sequence is 75% identical to the Homo sapiens ARS2 nucleic acid sequence (GenBank Accession No. XM_(—)005015.4) over a region spanning 614 nucleotides of SEQ ID NO:145. In addition, the zebrafish ARS2 polypeptide sequence is 69% identical and 79% similar to that of the human homologue (GenBank Accession No. AAK21005.1) over a region spanning 917 amino acids of SEQ ID NO:146.

[0778] Zebrafish mutant for the ARS2 gene have necrosis in the tectum and an underdeveloped jaw by day three of development. In addition, by day five of development, these mutants have flecks of pigment in the otoliths and widespread edema.

[0779] The Sec61 Alpha Gene

[0780] We isolated zebrafish mutants (alleles hi 1058 and hi 2839B) containing a viral insertion in the Sec61 Alpha nucleic acid sequence (GenBank Accession Nos. AY029527 (nucleotide) and AAK40295 (amino acid)). Both the hi 1058 and the hi 2839B alleles are the result of GT virus insertions in an intron, present in the genomic sequence, between nucleotides 132 and 133 of SEQ ID NO:147. The coding region of the zebrafish Sec61 Alpha nucleic acid sequence spans nucleotides 126 to 1556 of SEQ ID NO:147 and the amino acid sequence is provided in SEQ ID NO:148.

[0781] Zebrafish mutant for the Sec61 Alpha nucleic acid sequence have widespread defects by day three of development, including a bent body, a head and eyes that are at least 25% smaller than those of identically aged wild-type zebrafish, and no development of the jaw or branchial arches.

[0782] The BAF53a Gene

[0783] We isolated zebrafish mutants containing an insertion of a GT virus approximately at nucleotide 160 of the BAF53a nucleic acid sequence (SEQ ID NO:149). The coding region of this zebrafish nucleic acid sequence spans nucleotides 110 to 1396 of SEQ ID NO:149. In addition, the Zebrafish BAF53a nucleic acid sequence is 77% identical to the human homologue (GenBank Accession No. XM_(—)011050.2) over a region spanning 1288 nucleotides of SEQ ID NO:149. The amino acid sequence encoded by the zebrafish BAF53a nucleic acid sequence is 88% identical and 94% similar to the Mus musculus (GenBank Accession No. AAH01994.1) and Homo sapiens (GenBank Accession Nos. NP_(—)004292.1, XP_(—)011050.2, XP_(—)37377.1, XP_(—)37379.1, XP_(—)37376.1, XP_(—)37378.1, AAC94991.1, BAA74577.1, and AAH01391.1) homologues over a region spanning 429 amino acids of SEQ ID NO:150.

[0784] Zebrafish mutant for this nucleic acid sequence have a body that is curved sideways, a small, underdeveloped eye, and an enlarged ventricle with edema by day one of development.

[0785] The Histone Deacetylase Gene

[0786] We isolated zebrafish mutants (alleles hi 1618 and hi 2628) containing a viral insertion in a Histone Deacetylase nucleic acid sequence. The hi 1618 allele is the result of a GT virus insertion approximately at nucleotide 88 of SEQ ID NO:151, and the hi 2628 allele is the result of a GT virus insertion in an intron, present in the genomic sequence, between nucleotides 98 and 99 of SEQ ID NO:151. The coding region of this zebrafish Histone Deacetylase nucleic acid sequence spans nucleotides 46 to 1485 of SEQ ID NO:151. In addition, the zebrafish Histone Deacetylase gene is 81% identical to the Xenopus laevis Deacetylase (RPD3) gene (GenBank Accession No. AF020658.1) over a region spanning 1406 nucleotides, is 79% identical to the Homo sapiens Histone Deacetylase 1 gene (GenBank Accession No. BC000301.1) over a region spanning 1245 nucleotides, and is 77% identical to the Homo sapiens Histone Deacetylase 2 gene (GenBank Accession No. XM_(—)004370.4) over a region spanning 1253 nucleotides, of SEQ ID NO:151. Furthermore, the amino acid sequence encoded by this zebrafish histone deacetylase nucleic acid sequence is 90% identical and 96% similar to a likely Xenopus laevis Histone Deacetylase 1-2 (GenBank Accession No. 042227) over a region spanning 481 amino acids, is 87% identical and 92% similar to Homo sapiens (GenBank Accession No. XP_(—)004370.4) or Mus musculus (GenBank Accession Nos. NP_(—)032255.1 and P70288) Histone Deacetylase 2 over a region spanning 483 amino acids, of SEQ ID NO:152.

[0787] Zebrafish mutant for this Histone Deacetylase nucleic acid sequence have multiple developmental defects. By 48 hours into development, zebrafish mutant for this gene have an enlarged heart, with the atrium being at least twice the size of that of identically aged wild-type embryos. In addition, the eyes are at least 33% smaller than those of wild-type zebrafish and the ears lack semicircular canals and have otoliths that are either very close together or fused. Furthermore, by day three of development, the fin buds are not growing and the jaws and branchial arches are not visible. Finally, by day five of development, Alcian blue fails to stain any cartilage corresponding to the pectoral fins, jaw or branchial arches, or the neurocranium.

[0788] The Fibroblast Isoform of the ADP/ATP Carrier Protein Gene

[0789] We isolated zebrafish mutants containing an insertion of an F5 virus in an intron, present in the genomic sequence, between nucleotides 178 and 179 of the fibroblast isoform of the Fibroblast Isoform of the ADP/ATP Carrier Protein nucleic acid sequence (SEQ ID NO:153). The coding region of this zebrafish nucleic acid sequence spans nucleotides 68 to 961 of SEQ ID NO:153. The zebrafish Fibroblast Isoform of the ADP/ATP Carrier Protein gene is 83% identical to the Xenopus laevis Adenine Nucleotide Translocase (Ant1) GenBank Accession No. AF231347) over a region spanning 886 nucleotides, and is 82% identical to the Homo sapiens Solute Carrier Family 25 Adenine Nucleotide Translocator 5 (GenBank Accession No. NM_(—)001152.1) over a region spanning 870 nucleotides, of SEQ ID NO:153. Furthermore, the amino acid sequence encoded by this zebrafish nucleic acid sequence is 93% identical and 96% similar to the Xenopus laevis Adenine Nucleotide Translocase (GenBank Accession No. AAF63471.1) sequence over a region spanning 298 amino acids, and is 93% identical and 97% similar to the Homo sapiens Solute Carrier Family 25 Adenine Nucleotide Translocator 5 (GenBank Accession Nos. NP_(—)001143.1, P05141, AAA51737.1, and AAB39266.1) sequence over a region spanning 296 amino acids, of SEQ ID NO:154.

[0790] Zebrafish mutant for the Fibroblast Isoform of the ADP/ATP Carrier Protein nucleic acid sequence fail to inflate their swim bladder, but have no other readily apparent phenotypes.

[0791] The TAFII-55 Gene

[0792] We isolated zebrafish mutants containing an insertion of a GT virus in an intron, present in the genomic sequence, between nucleotides 107 and 108 of the TAFII-55 nucleic acid sequence (SEQ ID NO:155). The zebrafish TAFII-55 nucleic acid sequence is 75% identical to the Mus musculus TATA Box Binding Protein Associated Factor (GenBank Accession No. NM_(—)011901.1) spanning 553 nucleotides, and is 74% identical to the Homo sapiens TATA Box Binding Protein Associated Factor (GenBank Accession No. XM_(—)003757.2) over a region spanning 559 nucleotides, of SEQ ID NO:155. In addition, the amino acid sequence encoded by the zebrafish TAFII-55 nucleic acid sequence is 71% identical and 83% similar to the sequence of the Homo sapiens TATA Box Binding Protein Associated Factor (GenBank Accession Nos. XP_(—)003757.1, NP_(—)005633.2, XP_(—)049114.1, Q15545, CAA66636.1, and AAK30585.1), and is 68% identical and 81% similar to the sequence of the Mus musculus TATA Box Binding Protein Associated Factor (GenBank Accession Nos. NP_(—)036031.1 and AAD46767.1), over a region spanning 336 amino acids of SEQ ID NO:156.

[0793] Zebrafish mutant for the TAFII-55 nucleic acid sequence fail to inflate their swim bladder. In addition, by day five of development, these mutants have heads that are approximately 20% smaller, and have eyes that are approximately 33% smaller, than those of identically aged wild-type zebrafish.

[0794] The above experiments were carried out using the following materials and methods.

Materials and Methods

[0795] Zebrafish Strain Maintenance

[0796] Zebrafish were raised and maintained as described previously (Culp et al., Proc. Natl. Acad. Sci. USA 88:7953-7957, 1991), with the following exceptions. Synchronized eggs for injection were obtained by placing four females and two males (which had been separated the night before) in a 4 liter mating chamber for 10-15 min. Pair matings for raising F1 and F2 fish and for screening F3 embryos were performed in 1 liter mating chambers as described by Mullins et al. (Curr. Biol. 4:189-202, 1994). Paramecia fed to fry were counted and delivered in measured amounts three times a day; a total of approximately 400 paramecia per fry per day were required between days 5 and 7 and 800 paramecia per fry per day between days 8 and 11 to allow fish to reach a size at which they could eat brine shrimp.

[0797] Founder fish were generated from embryos from either of two lethal-free lines that were obtained as follows: We crossed outbred fish originally from the laboratory of Dr. Christianne Nüsslein-Volhard (Tübingen, Germany), but carried in our laboratory for approximately 6 years with AB*, a line selected by Charlene Walker (University of Oregon) as highly suitable for use in haploid and early pressure screens. We raised families from each of 15 pair matings. Sibling matings within each family were performed to identify families with no embryonic lethal mutations. Two lines, designated TAB-5 and TAB-14, were identified (no embryonic mutations seen in 18 matings from TAB-5 or in 22 matings from TAB-14) and used to obtain embryos for virus injections.

[0798] Virus Preparation and Injection

[0799] A packaging cell line 293 gp/bsr (Miyoshi et al., Proc. Natl. Acad. Sci. USA 94:10319-10323, 1997), grown in Dulbecco's modified Eagle medium supplemented with fetal calf serum, penicillin, streptomycin, and fungisome, was infected with SFGnlslacZ virus (Gaiano et al., Proc. Natl. Acad. Sci. USA 93:7777-7782, 1996) at three multiplicities of infection (M.O.I.s), 0.05, 0.5, and 5. Four days later, cells were trypsinized and stained with the vital stain fluorescein di-β-D-galactopyranoside (FDG), and passed through a cell sorter. Moderate and highly fluorescent cell populations from each of the three cell populations were selected, grown for 1 week, and then cloned. A total of 46 clones were screened to identify the one capable of producing the highest titer of virus following calcium phosphate-mediated transfection with the plasmid pHCMV-G (Yee et al., Proc. Natl. Acad. Sci. USA 91:9564-9568, 1994), which encodes the envelope protein of vesicular stomatitis virus. The medium was changed 24 hr after transfection, and collections of supernatant were made at 48, 72, 96, and 120 hours. The titer of virus harvested at these time points was measured using mouse 3T3 cells. Titers ranged from 0 to 5.4×10⁶ CFU/ml. The 10 best lines were selected and viral supernatants were titered on a fish cell line, PAC2. The F5 line, derived from an infection at a multiplicity of infection (M.O.I.) of 5, and harboring a single proviral genome, was selected and used to produce F5 virus for the experiment.

[0800] Large quantities of virus stocks were prepared by calcium phosphate transfection of F5 cells that had been seeded 1 day earlier on fifteen 15 cm tissue culture plates treated with 0.01% poly-L-lysine. A total of 50 μg of pHCMV-G DNA per plate was used in the transfection. Media were changed at 24 hr post-transfection, collected at 48, 72, and 96-hr post-transfection were filter sterilized (0.2 pm filter) and concentrated by centrifugation at 21,000 rpm with an SW28 rotor for 1.5 hr at 4° C. (Burns et al., Proc. Natl. Acad. Sci. USA 90:8033-8037, 1993). Viral pellets were resuspended in 30 μl of PBS, the titer determined, and either used fresh or frozen at −80° C. for future use. We estimate that approximately 250,000 embryos were injected on approximately 230 days over a period of 12 months.

[0801] Embryo Assay

[0802] To determine whether viral stocks had high titers on embryos and to ensure that founder fish that were raised were efficiently infected, we determined the proviral DNA content of several injected embryos from every batch injected using an assay designated the embryo assay.

[0803] For the bulk of the project, this assay was performed by quantitative Southern analysis. Ten injected embryos were lysed as five pools of two at 3-5 days of age in 100 mM Tris (pH 8.3), 200 mM NaCl, 5 mM EDTA, 0.4% SDS, 100 pg/ml proteinase K and lysed overnight. DNA was precipitated with ethanol, resuspended, and digested with PvuII, which cuts several times in the viral sequence. The samples were then subjected to electrophoresis through 0.8% agarose and Southern blots were performed along with a reference control from a fish with one proviral insert. The Southern blots were then hybridized with probes to the provirus and to the zebrafish RAG2 gene. The amount of DNA present in each band recognized by a probe was determined using a Molecular Dynamics Phosphoimager, the virus/RAG2 ratio calculated and normalized to the internal reference of 1. Subsequently, the embryo assay was performed using real time quantitative PCR. Single embryos were lysed at 2 days of age and processed as described below for the fin clips.

[0804] Identification of F1 Fish Carrying Multiple Proviral Insertions

[0805] We raised 30 fish per F1 family. To identify fish with at least three unique proviral inserts, we proceeded as follows. At 8-10 weeks of age, fish were anesthetized and placed on a small piece of PARAFILM® flexible thermoplastic film, and the end of their caudal fins were amputated with a scalpel and placed in wells of a 96-well plate. The fish were stored in disposable 16-oz (473 ml) cups while the fin clips were processed. DNA was extracted by incubation in 50 pl of ELVIS lysis buffer (50 mM KCI, 10 mM Tris at pH 8.5, 0.01% gelatin, 0.45% NP-40, 0.45% Tween-20, 5 mM EDTA, 200 pg/ml proteinase K) for at least 2 hr at 55° C. The Proteinase K was inactivated by placing the samples at 96° C. for 15 minutes.

[0806] Approximately 1 pl from each sample served as template for real time quantitative PCR with a Perkin-Elmer 7700 Sequence Detector (Heid et al., Genome Res. 6:986-994, 1996). Primers and the probe used to amplify viral sequences are as follows: SFG F, 5′-CGCTGGAAAGGACCTTACACA-3′; SFG R, 5′-TGCGATGCCGTCTACTTTGA-3′, and SFG probe, 5′-FAM-CTGCTGACCACCCCCACCGC-TAMRA-3′. A separate primer/probe combination was utilized for an internal reference amplicon to amplify the RAGI locus (RAG F, 5′-ATTGGAGAAGTCTACCAGAAGCCTAA-3′; RAG R, 5′-CTTAGTTGCTTGTCCAGGGTTGA-3′, RAG probe, 5′JOE-GCGCAACGGCGGC-GCTC-TAMRA-3′). The SFG primers and RAG primers were used at final concentrations of 74 and 150 nM, respectively, whereas both RAG and SFG probes were used at 200 nM. Each reaction was carried out in a final volume of 12.5 pl with Perkin-Elmer Master Mix. The cycling profiles were 2 minutes at 50° C., 10 minutes at 95° C., and 30×(15 seconds at 95° C., 1 minute at 60° C.). Each 96-well run contained six wells of a reference control from a fish with six inserts. At the end of each run, the RAG and SFG Cts (threshold cycle; the cycle at which the amount of product passed a certain threshold in the linear amplification range) were calculated for each sample and a ACt value was defined by subtracting the SFG Ct from the RAG Ct. The larger the ACt, value the greater the number of viral insets for any given sample. By subtracting the average six-insert fish ACt from each sample's ACt, we calculate the AACt, which can then be used in the following formula to estimate the number of inserts per fish: n=6×2AAct.

[0807] The eight fish with the highest ACt values from each F1 family were further analyzed by Southern blot to allow selection of fish with the greatest number of unique inserts among this group. The remaining fin clip sample for these fish was digested with BglII, which cuts only once in the provirus, and subjected to electrophoresis through 0.8% agarose for approximately 1200 volt-hours before being subjected to Southern analysis. The Southern Blots were then incubated with probes that yield one band per insert. Only fish with at least three unique inserts were kept and used to generate F2 families.

[0808] Identification of Mutagenic Insertions and Gene Cloning

[0809] We identified the mutagenic inserts as follows. We performed Southern blots using DNA from individual embryos having the desired phenotype (the entire sample was used) and from tails of adults which mated (approximately 20% of the DNA sample was used). For the Southern blot, the DNA was digested using BglII and probed with Probe 1 (see FIG. 3). We then identified the band common to all embryos having the desired phenotype and which is uniquely homozygosed in pairs of fish which gave the phenotype. The size of the band was then established to estimate the distance to the 5′ genomic BglII site. We then reprobed the blot with Probe 2 (see FIG. 3) to determine the distance to the 3′ genomic BglII site. If the 3′ and 5′ genomic BglII sites were easily amplified by inverse PCR and there were few other inserts present whose amplification product is smaller than, or close to, the desired product in size, we performed inverse PCR using the appropriate primers (see FIG. 3) as follows: for the 5′ side, primer 1 plus primer 3 (BglII), or primer 5 (NcoI or BamH1); for the 3′ side, primer 7 plus primer 2 (PstI), primer 4 (BglII), or primer 6 (NcoI). We then isolated the appropriately sized PCR products.

[0810] If inverse PCR could not easily be used to amplify the 3′ and 5′ genomic BglII sites, we mapped additional potential inverse PCR sites using further Southern analysis on DNA obtained from several tails now established as carriers or non-carriers of the desired insertion. This DNA was digested with NcoI, NcoI plus BspHI, and BamHI to probe with Probe 1. In addition, the DNA was digested with NcoI, NcoI plus BspHI, PstI, and PstI plus NsiI to probe with Probe 2. While BspHI and NsiI do not cut in the provirus, their restriction products are compatible for ligation with NcoI- and PstI-cut DNA, respectively.

[0811] Furthermore, if the additional Southern analysis did not yield sites that can be used for inverse PCR, we cut the tails of additional F2 fish for more Southern analysis with the enzymes to be used for inverse PCR. However, if an appropriate sample still could not be found, we outcrossed the fish with the fewest additional inserts and raised the offspring to five days of age and lysed each individually in 96 well plates. We then performed Southern analysis with half of the sample to identify samples with only one or few enough other inserts to be used for inverse PCR. Inverse PCR was carried out as described in Allende et al. (Genes & Dev. 10:3141-3155, 1996).

[0812] After inverse PCR, the sequences of the bands were compared with the public database with BLAST (Altschul et al., J. Mol. Biol. 215:403-410, 1990), and when significant homologies were found (usually to zebrafish Expressed Sequence Tags (ESTs)), expression of those genes were analyzed in mutant and wild-type embryos by Northern analysis, RT-PCR, and/or in situ hybridization as described in Allende et al. (Genes & Dev. 10:3141-3155, 1996).

[0813] Generation of Zebrafish Mutants

[0814] 1. Preparation of High Titer Stocks of F5 Virus

[0815] We used two viruses to generate founder fish, F5 and GT. A cell line producing the high titer F5 virus was prepared as follows. We obtained packaging cell line, 293 gp/bsr (Miyoshi et al., Proc. Natl. Acad. Sci. USA 94:10319-10323, 1997), infected it with a virus, SFGnlslacZ (Gaiano et al., Proc. Natl. Acad. Sci. USA 93:7777-7782, 1996), and selected a clone of cells designated F5 that yielded virus with high titer on both mouse 3T3 cells and a fish cell line PAC2 (Culp, Ph.D. Thesis, Massachusetts Institute of Technology, Cambridge, Mass. 1994), as determined by lacZ staining. Virus stocks were prepared by calcium phosphate transfection (Graham and van der Eb, Virology 52:456-467, 1973). During the course of the work, we found that lacZ titering of viruses on PAC2 cells was unreproducible, so we developed an assay to titer viruses on injected embryos.

[0816] 2. Injecting Virus: Monitoring Successful Injections by the Embryo Assay

[0817] To assess the efficiency with which injected embryos are infected, we used either quantitative Southern blotting or quantitative PCR. Two to five days after every injection session, several injected embryos were lysed and their DNA extracted for analysis. Two genomic sequences were probed, a single-locus gene RAG2 (Willett et al., Immunogenetics 45:394-404, 1997) and proviral sequences. The ratio of these signals was normalized to signals from DNA of a fish heterozygous for a single insertion. The result, designated the embryo assay value, was used as a measure of the average number of proviral integrations per cell. Injected eggs that were raised were assumed to have the same embryo assay value as those that were sampled from the same injected batch.

[0818] To determine that the embryo assay was a good predictor of efficient germ-line transmission of proviral insertions, founder fish from batches of injected embryos with a range of embryo assay values were tested to determine the amount of provirus they could transmit to their F1. We outcrossed the founders and used the quantitative assay for RAG2 versus proviral sequences on DNA extracted from pools of their F1 progeny. Although there was considerable variation between founders from injections that had yielded the same embryo assay value, there was a definite correlation between embryo assay and average provirus transmission rate. Most founders from injections with embryo assays below 2 did not transmit well enough for our purposes, about half the founders with embryo assays of 2-5 transmitted sufficiently well, and nearly all founders with embryo assays over 5 transmitted well with an average of greater than one insert per gamete. At this transmission rate, we found that a substantial proportion of F1 fish inherit multiple proviral insertions. With F5 virus, we kept batches of embryos from injections in which the embryo assay values ranged from 2 to 11.4. Injections to make 36,000 founder fish, of which 15,000 were made with F5 virus, were performed 5 days/week for 11 months by one to two injectors per day. We estimate that 250,000 embryos were injected, and hence the overall survival of injected embryos to adulthood was approximately 15%.

[0819] 3. Generating F1 Families, Selecting Multi-Insert F1 Fish, and Identifying Dominant Visible Mutations

[0820] To generate F1 families, we mate founder fish to each other. There is considerable variation in the number of inserts between fish in a single F1 family, as well as between F1 families. To identify fish with the most non-overlapping inserts, 30 F1 fish from each cross are raised for 8 to 12 weeks and then their tail fins are clipped (Westerfield, The zebrafish book University of Oregon Press, Eugene, Oreg., 1995). The fish are held in individual cups while DNA is extracted from the fin clips. A small amount of DNA is analyzed by real time quantitative PCR (Heid et al., Genome Res. 6:986-994, 1996) to identify the eight fish with the greatest number of inserts in each family, and the rest of the sample from these eight fish is used for Southern blot analysis. As shown in FIG. 2, the F1 fish with the greatest number of inserts are often derived from the same germ cell(s) and hence share proviral insertions.

[0821] 4. Generating F2families, Screening F3 Embryos, and Demonstrating that Mutants are Caused by Proviral Insertions

[0822] To generate F2 families, multi-insert F1 fish are mated and 50-70 embryos from each pair are raised. We perform sibling crosses of F2 fish at 3 months of age or older and examine their F3 embryos in a dissecting microscope to identify mutants. We examine embryos at 24 and 48 hr after fertilization and at 5 days of age (approximately 120 hr after fertilization). At day 5, embryos are screened for swimming behavior, then anesthetized, and visible structures are examined for defects.

[0823] To identify which of the insertions (of a total of approximately 10) segregating in an F2 family is linked to an identified mutation, Southern analysis is performed on DNA extracted from fin clips of parents of all the crosses screened in the family. A specific insert (Southern band) must be shared by both parents of every cross that showed the phenotype, and must be in either only one, or neither of the parents of all crosses that did not show the phenotype. We also perform Southern analysis on DNA from individual mutant embryos to look for the presence of this band. An unlinked band would only be present in three-fourths of the embryos, whereas a linked band must be in all of them. Often one can tell from the relative intensity of the bands that the candidate band is homozygous in mutant embryos. However, to obtain stronger evidence of tight linkage, we use a probe to genomic DNA flanking the candidate band.

[0824] Once a candidate band is identified, a junction fragment from either or both sides of this insertion is cloned by inverse PCR (Ochman et al., Genetics 120:621-623, 1988). The strategy used to clone the correct junction fragment from families with many inserts is shown in FIG. 3. The junction fragment is then used for two purposes: First, to distinguish between chromosomes with and without the putative mutagenic insertion on Southern blots, thereby determining whether mutant embryos are invariably homozygous for the mutagenic insert and verifying that their wild-type siblings never are; and second, to see if the junction fragment, when sequenced, has homology to a known gene or EST in the public database.

[0825] For recessive mutants, heterozygotes were crossed, embryos were sorted as having a mutant phenotypic or as being wild-type, and DNA was extracted from individual embryos and analyzed by Southern analysis for genotyping. For dominant mutants, heterozygotes were crossed to wild-type fish, juvenile fish were sorted by phenotype, and DNA was extracted from fin clips and analyzed by Southern analysis or PCR.

Genes and Disease

[0826] The invention provides methods for the diagnosis and treatment of a variety of diseases or disorders, including human diseases and disorders. In addition, the invention also provides methods for the identification of therapeutic compounds for the treatment of these diseases or disorders.

[0827] Proliferative Disorders

[0828] Zebrafish with proliferative disorders, for example, those containing a mutation in a 904, Pescadillo, 1055-1, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, 821-02, Cyclin A2, Kinesin-Related Motor Protein EG5, 459, or 299 nucleic acid or amino acid sequence may be useful for the diagnosis, treatment, and/or identification of therapeutic compounds in cancer. One example of a zebrafish mutant with a proliferative disorder is the 904 mutant. 904 mutants display an overgrowth of neuronal tissues. Many cancers arise due to defects in the regulation of the cell cycle. Mutants, such as Pescadillo, 1055-1, or Cyclin A2, contain viral insertions in cell cycle checkpoint genes that are also of interest in the diagnosis and treatment of cancer or neuroblastoma. Alternatively, mutations that result in increased apoptosis may identify drug targets that are useful for the treatment of proliferative disorders, such potentially useful mutations may be in a Kinesin-Related Motor Protein EG5, 459, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, 821-02, or 299 nucleic acid sequence. In addition, 904, Pescadillo, 1055-1, Cyclin A2, Kinesin-Related Motor Protein EG5, 459, and 299 nucleic acid or amino acid sequences may be used in methods to diagnose, prevent, and treat proliferative disorders using the methods described herein.

[0829] Bone Connective Tissue, or Cartilage Formation Disorders

[0830] Zebrafish with defects in bone, connective tissue, or cartilage development, for example, those containing a mutation in a 954, Histone Deacetylase, 215, 307, 572, 1116A, 1548, Casein Kinase 1α, smoothened, 299, TCP-1 Beta, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, 1447, and 994 nucleic acid or amino acid sequence may be useful for the diagnosis or treatment of a cartilage, connective tissue, or bone-related disorder. In particular, such potentially useful mutants include those with viral insertions in the 954 and Histone Deacetylase genes. Zebrafish containing a mutation in a 954 nucleic acid sequence lack dTDP-glucose 4-6-dehydratases, and have cartilage that fails to stain with Alcian blue. A histone deacetylase mutant also fails to stain with Alcian blue, and lacks many cartilaginous structure such as jaws, neurocranium, and fins. The 215 mutant has defects in jaw and ceratohyal formation. The 307, 572, 1116A, 1548, Casein Kinase 1α, Smoothened, 299 and 994 mutants have defects in the formation of cartilaginous structures. In addition, 954, histone deacetylase, 215, 307, 572, 1116A, 1548, Casein kinase 1α, smoothened, 299, TCP-1 Beta, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, 1447, and 994 nucleic acid or amino acid sequences may be used in methods to diagnose, prevent, and treat defects in bone, connective tissue, or cartilage development using the methods described herein. Cartilage or connective tissue disorders that may be diagnosed or treated using the nucleotides or polypeptides described herein include, for example, arthritis, rheumatoid arthritis, juvenile rheumatoid arthritis, Osteogenesis Imperfecta, or Marfan syndrome. Bone diseases or disorders that may be diagnosed or treated using the nucleotides or polypeptides described herein include, for example, Paget's disease, fibrous dysplasia, osteochondritis dissecans, Saethre-Chotzen syndrome, or osteoporosis

[0831] Cell Death Disorders

[0832] Cell death, for example, apoptosis, plays a role in a number of neurodegenerative disorders. There is mounting evidence supporting an apoptosis-necrosis cell death continuum. In this continuum, neuronal death can result from varying contributions of coexisting apoptotic and necrotic mechanisms (Martin, Int. J Mol. Med. 7:455-78, 2001). A number of zebrafish mutants, such as U2AF, Ribonucleotide Reductase R1 Class 1, Kinesin-Related Motor Protein EG5, 459, 299, 1373, Denticleless, Ribonucleotide Reductase Protein R2, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonucleoprotein C, Ski Interacting Protein, 297, Small Nuclear Ribonucleoprotein D1 Protein, or DNA Polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, 1581, DEAD-Box RNA helicase (DEAD5 or DEAD19), cyclin A2, ISWI/SNF2, XCAP-C, DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, ARS2 and Protein Phosphatase 1 Nuclear Targeting Subunit, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, Non-Muscle Adenylosuccinate Synthase, and 821-02 mutants display either necrotic or apoptotic cell death. Accordingly, these nucleic acid and amino acid sequences and zebrafish with defects in any of these nucleic acid or amino acid sequences may be used in the diagnosis, prevention, or treatment of cell death disorders using the methods described herein. Zebrafish mutants that show excess apoptotic cell death, such as Kinesin-Related Motor Protein EG5, 459, Valyl-tRNA Synthase, Translation Elongation Factor eEF1 Alpha, 821-02, and 299 are of particular interest. Cell death disorders that may be treated using the polypeptides and polynucleotides described herein include, for example, neurodegenerative disorders characterized by excess cell death, such as Alzheimer's disease, Parkinson's disease, spinocerebellar ataxia, or Huntington's disease. Necrotic disorders that may be treated using the polypeptides and polynucleotides described herein include, for example, Leigh's disease or subacute necrotizing encephalomyelopathy

[0833] Circulatory Defects

[0834] Multiple cell populations are involved in the morphogenesis of the cardiac, arterial, and venous systems as well as in the correct alignment and connection of the developing vessels within the cardiac chambers. The development of this intricate cell network is subject to a high rate of congenital abnormalities. Zebrafish with defects in the formation of the circulatory system, for example, those containing a mutation in a 904, 429, 1463, or Mitochondrial Inner Membrane Translocating (rTIM23) nucleic acid or amino acid sequence may be useful in the diagnosis or treatment of diseases or disorders of the circulatory system. In addition, 904, 429, 1463, and Mitochondrial Inner Membrane Translocating (rTIM23) nucleic acid and amino acid sequences may be used to diagnose, treat, or prevent circulatory disorders using the methods described herein. Circulatory, disorders that may be treated using the polypeptides and polynucleotides described herein include, for example, atherosclerosis, stroke, thrombosis, peripheral arterial disease, hypertension, hypotension, or peripheral vascular disease.

[0835] Craniofacial Defects

[0836] Molecular genetic studies have shown that mutations in the genes governing bone morphogenesis signaling networks cause a variety of human craniofacial defects. Such birth defects may be prevented or diagnosed if their etiology is understood. Zebrafish mutants with viral insertions in the following genes may be useful in the diagnosis, treatment, or prevention of such craniofacial birth defects: Nrp-1, 215, 307, 572, 1116A, 1548, Casein Kinase 1 α, Smoothened, 299, 297, DNA Replication Licensing Factor MCM2, 1257, TCP-1 Alpha, TCP-1 Beta, Nuclear Cap Binding Protein Subunit 2, Non-Muscle Adenylosuccinate Synthase, Ornithine Decarboxylase, and Sec61 Alpha. In addition, Nrp-1, 215, 307, 572, 1116A, 1548, casein kinase 1 α, Smoothened, 299, 297, DNA Replication Licensing Factor MCM2, 272, 1257, TCP-1 Alpha, TCP-1 Beta, Nuclear Cap Binding Protein Subunit 2, Non-Muscle Adenylosuccinate Synthase, Ornithine Decarboxylase, and Sec61 Alpha nucleic acid and amino acid molecules may also be used to diagnose, treat, or prevent craniofacial malformations using the methods described herein. Craniofacial disorders that may be treated, diagnosed, or prevented using the polypeptides and polynucleotides described herein include, for example, Apert, Crouzon, Pfeiffer and Saethre-Chotzen Syndromes, Treacher-Collins syndrome, Marfan's syndrome, or Angelman's disease, or an eye malformation syndrome, such as oculorenal syndrome, or Reiger syndrome.

[0837] Hearing Disorders

[0838] It is estimated that at least 1-1.5/1000 live births result in congenital permanent hearing impairment. Congenital hearing loss has profound effects on the speech, language, and social development of children. Many adults also suffer from varying degrees of aquired hearing loss. Mutants with viral insertions in the following genes have defects in otolith, semi-circular canal, or hair cell development: POU2, HNF1-β/vHNF1, U1 Small Nuclear Ribonucleoprotein C, Cyclin A2, ARS2, TCP-1 Eta, or Histone Deaceytlase. Accordingly, zebrafish mutant for a POU2, HNF1-β/vHNF1, U1 Small Nuclear Ribonucleoprotein C, Cyclin A2, ARS2, TCP-1 Eta or Histone Deaceytlase nucleic acid or amino acid sequence, as well as these nucleic acid and amino acid molecules themselves, may be useful for the diagnosis, prevention, or treatment of either congenital or acquired hearing disorders using the methods described herein. Disorders that may be treated, diagnosed, or prevented using the polypeptides and polynucleotides described herein include, for example, congenital deafness or a sensory disorder, such as Usher syndrome or Waardenburg syndrome, or acquired hearing loss resulting from Paget's disease or osteogenesis imperfecta.

[0839] Diabetes

[0840] Diabetes is a growing problem throughout the world. Identifying the underlying genetic defects that lead to diabetes may be useful for the prevention, diagnosis and/or treatment of this disease. Genes such as HNF1-β/vHNF1, 429, and 1447 are required for normal pancreas development in zebrafish. HNF1-β/vHNF1 mutants display an abnormal pancreas. Interestingly, HNF1-β/vHNF1 is homologous to human Hepatic Transcription Factor 1; mutations in human HNF1 cause a genetic form of human diabetes, MODY V (maturity onset diabetes of the young), in which patients have kidney defects in addition to diabetes (Iwasaki et al., Diabetes Care 21:2144-2148, 1998; Horikawa et al., Nat. Genet. 17:384-385, 1997). Accordingly, zebrafish containing a mutant HNF1-β/vHNF1, 429, or 1447 nucleic acid or amino acid sequence may be useful for identifying compounds that may be used to diagnose, treat, or prevent diabetes in humans. In addition, the HNF1-β/vHNF1, 429, and 1447 nucleic acid and amino acid sequences themselves may be used in diagnosing, preventing, or treating diabetes using the methods described herein. Pancreatic disorders that may be treated, diagnosed, or prevented using the polypeptides and polynucleotides described herein include, for example, juvenile diabetes, type I diabetes, type II diabetes, diabetes insipidus, or gestational diabetes.

[0841] Heart Defects and Disorders

[0842] Congenital heart defects are one of the most common form of birth defects. Nearly 1/125-1/150 children born each year have a congenital heart defect. Moreover, congenital heart defects are one of the most common causes of infant mortality. Identifying the underlying genetic defects that result in congenital heart defects may be useful for the prevention, diagnosis and/or treatment of congenital heart defects. Zebrafish with viral insertions in the 1548, Mitochondrial Inner Membrane Translocating Protein (rTIM23), or TCP-1 Alpha, histone deaceytlase, and Nuclear Cap Binding Protein Subunit 2 nucleic acid or amino acid sequences display heart defects. Accordingly, such zebrafish may be used to identify test compounds that may be used to diagnose, prevent, or treat human heart defects. In addition, 1548, Mitochondrial Inner Membrane Translocating Protein (rTIM23), or TCP-1 Alpha, histone deaceytlase, and Nuclear Cap Binding Protein Subunit 2 nucleic acid and amino acid molecules themselves may be useful in diagnosing, preventing, or treating human heart defects using the methods described herein. Heart defects, diseases, or disorders that may be treated using the polypeptides and polynucleotides described herein include, for example, congenital heart defects, such as ventricular or atrial septal defects, or heart disease, such as heart attack, congestive heart failure, or coronary artery disease.

[0843] Infertility

[0844] Various genetic defects can lead to infertility. Early embryonic death can be caused by genetic defects including defects in the spinster nucleic acid sequence. Accordingly, zebrafish mutant for a spinster nucleic acid or amino acid sequence may be used to identify test compounds that are useful in the diagnosis, prevention, or treatment of human infertility. In addition, spinster nucleic acid or amino acid molecules themselves may be used to diagnose, prevent, or treat human infertility using the methods described herein.

[0845] Kidney Disorders

[0846] Congenital kidney defects are a common birth defect. In fact, in 2002, more than 8 million Americans had seriously reduced kidney function and about 400,000 required dialysis or a kidney transplant to stay alive. This figure has doubled in the last ten years. One common form of kidney disorders involves cystic kidney disorders which are prevalent among children born with developmental defects in kidney formation. Cystic kidney disorders are also prevalent in adults. Cystic kidney disorders involve the formation of fluid-filled sacs in the kidney. These cysts tend to develop in weak segments of the tubules that carry urine from the glomeruli. The cyst's growth displaces healthy tissue. In polycystic kidney disease, cysts generally occur in both the left and the right kidney. The polycystic kidney often retains its shape despite the presence of multiple cysts. These cysts, however, may impair the kidney's function. In contrast, multicystic kidney disease usually affects only one of the two kidneys. The affected kidney loses its characteristic bean shape and, instead, resembles a cluster of grapes. Moreover, the affected kidney does not function.

[0847] Zebrafish with viral insertions in a 459 or HNF1-β/vHNF1 nucleic acid sequence may have kidney abnormalities, including, for example, a cystic kidney phenotype. Accordingly, zebrafish having a mutation in a 459 or HNF1-β/vHNF1 nucleic acid or amino acid sequence may be used in screen to identify compounds that are useful in diagnosing, preventing, or treating kidney disease. In addition, 459 or HNF1-β/vHNF1 nucleic acid and amino acid sequences themselves may be using in methods of diagnosing, preventing, or treating kidney disease, as described herein. Kidney defects, diseases, or disorders that may be treated using the polypeptides and polynucleotides described herein include, for example, polycystic kidney disease, multicystic kidney disease, malformation of the kidney, Bardet-Biedl syndrome, kidney failure, acute renal failure, nephrolithiasis, congenital nephritic syndrome, kidney infection, or kidney stones.

[0848] Limb Formation Defects

[0849] Defects in limb formation are another common birth defect. Zebrafish with viral insertions in genes such as Casein Kinase 1 α, Smoothened, and Histone Deacetylase have fin abnormalities that may be analogous to human limb formation abnormalities. Accordingly, zebrafish having a mutation in a Casein Kinase 1 α, Smoothened, and Histone Deacetylase nucleic acid or amino acid sequence may be used to identify test compounds that affect limb formation. In addition, Casein Kinase 1 α, Smoothened, and Histone Deacetylase nucleic acid and amino acid sequences themselves may be used in methods for diagnosing, preventing, or treating limb abnormalities. Limb formation defects, diseases, or disorders that may be treated using the polypeptides and polynucleotides described herein include, for example, such as achondroplasia, Ellis-van Creveld syndrome, Poland sequence, or Pfeiffer syndrome.

[0850] Mental Retardation and Mental Diseases

[0851] Zebrafish with defects in brain development may be used in methods of identifying test compounds that affect brain development. Accordingly, these mutant zebrafish, for example, zebrafish having a mutation in a Telomeric Repeat Factor 2, U1 Small Nuclear Ribonucleoprotein C, DNA polymerase Epsilon Subunit B, 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), Chromosomal Assembly Protein (XCAP-C), DNA Replication Licensing Factor MCM2, ARS2, 40S ribosomal protein S18, Histone Deacetylase, Protein Phosphatase 1 Nuclear Targeting Subunit protein (PNUTS), Ornithine Decarboxylase, 1447, U2AF, 1463, 1262, VPSP18, 994, 60S ribosomal L35, 60S ribosomal L44, Smoothened, SIL, Ski Interacting Protein, 297, Small Nuclear Ribonucleoprotein D1, 1581, Cyclin A2, ISWI/SNF, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S ribosomal S5, TCP-1 Alpha, TCP-1 Beta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2 Polypeptide, Sec61 Alpha, BAF53a, or TAFII-55 nucleic acid or amino acid sequence may be useful in diagnosing, preventing, or treating mental retardation. In addition, Telomeric Repeat Factor 2, U1 Small Nuclear Ribonucleoprotein C, DNA polymerase Epsilon Subunit B. 1045, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), Chromosomal Assembly Protein (XCAP-C), DNA Replication Licensing Factor MCM2, ARS2, 40S ribosomal protein S18, Histone Deacetylase, Protein Phosphatase 1 Nuclear Targeting Subunit protein (PNUTS), Ornithine Decarboxylase, 1447, U2AF, 1463, 1262, VPSP18, 994, 60S ribosomal L35, 60S ribosomal L44, Smoothened, SIL, Ski Interacting Protein, 297, Small Nuclear Ribonucleoprotein D1, 1581, Cyclin A2, ISWI/SNF, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S ribosomal S5, TCP-1 Alpha, TCP-1 Beta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2 Polypeptide, Sec61 Alpha, BAF53a, or TAFII-55 nucleic acid and amino acid sequences themselves may be used in methods of diagnosing, preventing, and treating mental retardation. Mental defects, diseases, or disorders that may be treated using the polypeptides and polynucleotides described herein include, for example, Down's syndrome, Chiari Malformation, colpocephaly, holoprosencephaly, lissencephaly, or megalencephaly, microencephaly, anencephaly, autism, schizophrenia, depression, dementia, or bipolar disorders.

[0852] Muscle Defects

[0853] Defects in muscle development or function are another prevalent class of congenital birth defects. We determined that the zebrafish genes 428, Smoothened, Glypican-6 or Knypek, and Denticleless are required for proper muscle development. Accordingly, zebrafish mutant for a 428, Smoothened, Glypican-6 or Knypek, or Denticleless nucleic acid or amino acid sequence may be used in screens for idenifying compounds that affect muscle development. In addition, 428, Smoothened, Glypican-6 or Knypek, and Denticleless nucleic acid and amino acid sequence themselves may be used in methods for diagnosing, preventing, or treating congenital muscle disorders. Muscle defects, diseases, or disorders that may be treated using the polypeptides and polynucleotides described herein include, for example, myotonia congenital, inclusion body myositis, or a congenital muscular dystrophy.

[0854] Neurodegenerative Disorders

[0855] As people are living longer in developed countries, a quiet epidemic of dementia is emerging. By age 85, one of every three people is demented and the vast majority of these people suffer from neurodegenerative illnesses. In the United States alone, it is estimated that nearly 5 million people suffer from dementia. Understanding the causes of neurodegenerative disorders is extremely important to developing effective therapies. Zebrafish having a mutation in a Aryl Hydrocarbon Receptor Nuclear Translocator 2A, Vesicular Integral-Membrane Protein VIP 36, 297, 40S Ribosomal Protein S5, Cad-1, V-ATPase16 kDa Proteolytic Subunit, 459, 299, Ribonucleotide Reductase Protein R1 Class 1, Kinesin Related Motor Protein EG5, 821-02, Valyl-tRNA Synthase, or Translation Elongation Factor eEFI Alpha nucleic acid or amino acid sequence display neurodegenerative phenoypes. For example, many human patients with motor neuron disease are touch insensitive in addition to displaying impaired movement. The Cad-1 mutants are an example of a zebrafish mutant that displays a similar phenotype. These mutants are touch insensitive and lose motility. Accordingly, these zebrafish may be useful in screens to identify test compounds that may be used in diagnosing, preventing, or treating a neurodegenerative disorder. In addition, Aryl Hydrocarbon Receptor Nuclear Translocator 2A, Vesicular Integral-Membrane Protein VIP 36, 297, 40S Ribosomal Protein S5, Cad-1, V-ATPase 16 kDa Proteolytic Subunit, 459, 299, Ribonucleotide Reductase Protein R1 Class 1, Kinesin Related Motor Protein EG5, 821-02, Valyl-tRNA Synthase, or Translation Elongation Factor eEF1 Alpha nucleic acid or amino acid sequences themselves may be used in methods for diagnosing, preventing, or treating a neurodegenerative disorder, as described herein. Neurodegenerative defects, diseases, or disorders that may be treated using the polypeptides and polynucleotides described herein include, for example, Huntington's disease, Parkinson's disease, amyotrophic lateral sclerosis, or cerebellar ataxias.

[0856] Retinal Disorders

[0857] Retinal degenerative diseases, including retinitis pigmentosa and age-related macular degeneration, are a common cause of a visual defect. Understanding the genetic defects that underlie various retinal degenerative diseases may lead to improved diagnosis, prevention, and/or therapies. Zebrafish having a mutation in a CopZ1, VPSP18, Ribonucleotide Reductase R1 Class 1, Telomeric Repeat Factor 2, SIL, or ISWI/SNF2 nucleic acid or amino acid sequence display retinal degeneration. Accordingly, such mutant zebrafish may be used in screens to identify test compounds that affect retinal degeneration. In addition, CopZ1 nucleic acid and amino acid sequences may be used in methods to diagnose, prevent, or treat retinal degeneration, as described herein. Retinal defects, diseases, or disorders that may be treated using the polypeptides and polynucleotides described herein include, for example, retinitis pigmentosa, macular degeneration, Friedreich's ataxia, or Laurence-Moon syndrome, and pigmentation disorders, such as human pigmentary glaucoma, Wagner's disease, Cockayne's syndrome, Kearns-Sayre disease, Waardenburg syndrome, and Usher's syndrome.

[0858] Stroke

[0859] Stroke is a leading cause of death, killing approximately 160,000 Americans each year and is also the number one cause of adult disability. Given the prevalence of stroke, it is important to identify genetic mutations that may be used to assess genetic predisposition to stroke, as well as to identify drug targets for stroke therapies. Based on the phenotype of zebrafish having a mutation in a 1463, 904, or Mitochondrial Inner Membrane Translocating Protein (rTIM23) gene, and based on the homology of the Mitochondrial Inner Membrane Translocating (rTIM23) protein to LimpII, which interacts with an inhibitor of angiogenesis, 1463, 904, or Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid and amino acid molecules may function in the development of vasculature in the brain and may be an important therapeutic target for stroke therapy. Accordingly, zebrafish mutant for a 1463, 904, or Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid and amino acid molecule may be used in screens to identify compounds that may be used to diagnose, prevent, or treat stroke or other bleeding disorders in the brain. In addition, 1463, 904, or Mitochondrial Inner Membrane Translocating Protein (rTIM23) nucleic acid and amino acid molecules themselves may also be used to diagnose, prevent, or treat such disorders using the methods described herein.

[0860] Stem Cell Generation

[0861] The identification of genes that function in stem cell generation is important to organ regeneration. Mutants with defects in stem cell populations lack entire organs. Zebrafish with a mutation in a 1447 nucleic acid or amino acid sequence have no pancreas. Accordingly, such mutant zebrafish may be used in methods of identifying test compounds that affect organ regeneration. In addition, 1447 nucleic acid or amino acid molecules themselves may be used in methods of organ regeneration as described herein.

[0862] Pigmentation

[0863] Pigmentary glaucoma is a significant cause of human visual defects. In this disease, abnormally liberated iris pigment and cell debris enter the ocular drainage structures, leading to increased intraocular pressure (IOP) and glaucoma. In addition, abnormal pigment production and mutant melanosomal protein genes may contribute to human pigmentary glaucoma. Accordingly, therapeutic strategies designed to decrease pigment production may be beneficial in preventing or treating human pigmentary glaucoma. Therefore, nucleic acids and polypeptides that, when altered, result in pigmentation defects may be used in the diagnosis or treatment of visual defects, for instance, blindness due to human pigmentary glaucoma. Zebrafish that have mutations in a V-ATPase SFD Subunit, V-ATPase 16 kDa Proteolytic Subunit, CopZ1, V-ATPase Alpha Subunit, 1463, VPSP18, 297, Valyl-tRNA Synthase, Mitochondrial Inner Membrane Translocating Protein (rTIM23), or ARS2 nucleic acid or amino acid sequence have pigmentation defects. Consequently, such mutant zebrafish may be used in methods of identifying test compounds that may be useful in the diagnosis, prevention, or treatment of visual defects. In addition, V-ATPase SFD Subunit, V-ATPase16 kDa Proteolytic Subunit, CopZ1, V-ATPase Alpha Subunit, 1463, VPSP18, 297, Valyl-tRNA Synthase, Mitochondrial Inner Membrane Translocating Protein (rTIM23), or ARS2 nucleic acid and amino acid sequences themselves may be used in methods to treat visual defects, as described herein. Pigmentation defects, diseases, or disorders that may be prevented, treated, or diagnosed using the polypeptides and polynucleotides described herein include, for example, human pigmentary glaucoma, Wagner's disease, Cockayne's syndrome, Kearns-Sayre disease, Waardenburg syndrome, Usher's syndrome, or retinitis pigmentosa.

[0864] Pulmonary Disorders

[0865] The zebrafish swim bladder is thought to be a primitive lung. Consequently, zebrafish, for example, zebrafish having a mutation in a Fibroblast Isoform of the ADP/ATP Carrier Protein or TAFII-55 nucleic acid or amino acid sequence, with defective swim bladders may be used in screens to identify test compounds that may be useful in the diagnosis, prevention, or treatment of pulmonary disorders. In addition, Fibroblast Isoform of the ADP/ATP Carrier Protein and TAFII-55 nucleic acid and amino acid molecules may be used in the diagnosis, prevention, or treatment of pulmonary disorders using methods described herein. Pulmonary defects, diseases, or disorders that may be treated using the polypeptides and polynucleotides described herein include, for example, asthma or chronic bronchitis.

[0866] Movement Disorders

[0867] Movement defects of varying etiologies are prevalent in both children and adults. Understanding the etiology of the various movement disorders may lead to improved therapies. Zebrafish having a mutation in a Cad-1, Aryl Hydrocarbon Receptor Nuclear Translocator 2A, or 40S Ribosomal Protein S5, Neurogenin Related Protein 1, SIL, U1 Small Nuclear Ribonucleoprotein C, Ski Interacting Protein nucleic acid or amino acid sequence display motility defects. Accordingly, such mutant zebrafish may be used in methods to identify test compounds that affect motility. In addition, Cad-1, Aryl Hydrocarbon Receptor Nuclear Translocator 2A, or 40S Ribosomal Protein S5, Neurogenin Related Protein 1, SIL, U1 Small Nuclear Ribonucleoprotein C, or Ski Interacting Protein nucleic acid or amino acid sequences may be used in methods to diagnose, prevent, or treat human movement disorders. Movement defects, diseases, or disorders that may be treated using the polypeptides and polynucleotides described herein include, for example, Angelman's syndrome, tardive dyskinesia, spasticity, ataxias, or spinocerebellar ataxia; also neurodegenerative disorders, for example, Huntington's disease, or Parkinson's disease, Alzheimer's disease.

[0868] Somite Formation

[0869] Congenital myopathies are developmental disorders of muscle. These congenital defects include segmental amyoplasia, generalized amyoplasia, and congenital muscle fiber-type disproportion. Neonatal myotonic dystrophy is a maturational delay in muscle development. Zebrafish having a mutation in a 428, Smoothened, Glypican-6 or Knypek, or Denticleless, or 60S Ribosomal L35 nucleic acid or amino acid sequence exhibit abnormal somite formation. Accordingly, such zebrafish mutants may be used in methods of identifying test compounds that may be used to diagnose, prevent, or treat congenital muscle disorders. In addition, 428, 60S Ribosomal Protein L35, Smoothened, Glypican-6 or knypek, and Denticleless nucleic acid and amino acid sequences may be used in methods to diagnose, prevent, or treat congenital muscle disorders, as described herein. Muscle defects, diseases, or disorders that may be treated using the polypeptides and polynucleotides described herein include, for example, myotonia congenital, inclusion body myositis, muscular atrophy, or a congenital muscular dystrophy.

Treatment

[0870] The nucleic acids and polypeptides of the present invention are useful for treating a variety of diseases and disorders, including proliferative disorders, bone, connective tissue, and cartilage formation disorders, cell death disorders, circulatory defects, craniofacial defects, visual defects, hearing disorders, diabetes, heart defects and diseases, kidney defects, limb formation defects, mental retardation, muscle defects, neurodegenerative disorders, retinal degeneration, stroke, stem cell population defects, pigmentation disorders, movement disorders, and somite formation defects. Nucleic acids and polypeptides of the present invention, for example, a 904, POU2, 40S Ribosomal Protein S18, U2AF, 954, Nrp-1, Cad-1, V-ATPase alpha subunit, V-ATPase SFD subunit, V-ATPase 16 kDa proteolytic subunit, 1463, VPSP18, pescadillo, HNF1-β/vHNF1, 60S Ribosomal Protein L35, 60S Ribosomal Protein L44, CopZ1, 215, 307, 572, 1116A, 1548, Casein Kinase 1 α, Nodal-Related or Squint, Smoothened, 429, 428, Spinster, Glypican-6 or Knypek, Ribonucleotide Reductase Protein R1 Class 1, Kinesin-Related Motor Protein EG5, 459, Wnt5 or Pipetail, Aryl Hydrocarbon Receptor Nuclear Translocator 2A, Vesicular Integral-Membrane Protein VIP 36, 299, 994, 1373, Denticleless, Ribonucleotide Reductase Protein R2, TCP-1 Alpha, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonucleoprotein C, Ski Interacting Protein, 297, TCP-1 Complex Gamma Chain, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 821-02, 1045, 1055-01, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, Cyclin A2, ISWI/SNF2, Chromosomal Assembly Protein C (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Beta, TCP-1 Eta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS), Mitochondrial Inner Membrane Translocating Protein (rTIM23), 1447, ARS2, Sec61 Alpha, BAF53a, Histone Deacetylase, Fibroblast Isoform of the ADP/ATP Carrier Protein, or a TAFII-55 nucleic or amino acid sequence, may be administered by any of a variety of routes known to those skilled in the art, such as, for example, by intraperitoneal, subcutaneous, parenteral, intravenous, intramuscular, or subdermal injection. However, the nucleic acids or polypeptides may also be administered as an aerosol, as well as orally, nasally, or topically. Appropriate carriers or diluents for, as well as what is essential for the preparation of a pharmaceutical composition are described, e.g., in Remington's Pharmaceutical Sciences (18^(th) edition), ed. A. Gennaro, 1990, Mack Publishing Company, Easton, Pa., a standard reference book in this field.

[0871] Formulations for parenteral administration may, for example, contain excipients, sterile water, or saline. For inhalation, formulations may contain excipients, for example, lactose. Aqueous solutions may be used for administration in the form of nasal drops, or as a gel for topical administration. The exact dosage used will depend on the severity of the condition or the general health of the patient and the route of administration. One skilled in the art would know how to determine and adjust the dosage as required. These nucleic acids or polypeptides may be administered once, or they may be repeatedly administered as part of a regular treatment regimen over a period of time.

[0872] In addition, the invention provides methods of gene therapy by targeting, for example, a nucleic acid sequence descibed herein. Such a nucleic acid sequence may be introduced into an abnormal cell, for example, by using liposome-based transfection techniques, to treat a disorder (Units 9.1-9.4, Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1995). Such DNA constructs may also be introduced into mammalian cells using an adenovirus, or retroviral or vaccinia viral vectors (Units 9.10 and 16.15-16.19, Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1995). These standard methods of introducing DNA into cells are applicable to a variety of cell-types.

[0873] For example, recombinant adenoviral vectors offer several significant advantages for gene transfer. The viruses can be prepared at extremely high titer, infect non-replicating cells, and confer high-efficiency and high-level transduction of target cells in vivo after directed injection or perfusion. Either directed injection or perfusion would be appropriate for delivery of vectors containing a therapeutic gene in a clinical setting. When a viral vector is used to administer the nucleic acids of the invention, standard concentrations include, for example, 10², 10³, 10⁴, 10⁵, or 10⁶ plaque forming units (pfu)/animal, in a pharmacologically acceptable carrier.

[0874] In animal models, adenoviral gene transfer has generally been found to mediate high-level expression for approximately one week. The duration of transgene expression may be prolonged, and ectopic expression reduced, by using tissue-specific promoters. Other improvements in the molecular engineering of the adenoviral vector itself have produced more sustained transgene expression and less inflammation. This is seen with so-called “second generation” vectors harboring specific mutations in additional early adenoviral genes and “gutless” vectors in which virtually all the viral genes are deleted utilizing a Cre-Lox strategy (Engelhardt, et al., Proc. Natl. Acad. Sci. USA 91:6196-6200, 1994; Kochanek, et al., Proc. Natl. Acad. Sci. USA 93:5731-5736, 1996).

[0875] In addition, recombinant adeno-associated viruses (rAAV), derived from non-pathogenic parvoviruses, may be used to express a target gene as these vectors evoke almost no cellular immune response, and produce transgene expression lasting months in most systems. Incorporation of a tissue-specific promoter is, again, beneficial. Furthermore, besides adenovirus vectors and rAAVs, other vectors and techniques are known in the art, for example, those described by Wattanapitayakul and Bauer (Biomed. Pharmacother. 54:487-504, 2000), and citations therein.

[0876] A vector carrying a therapeutic gene can be delivered to the target organ through in vivo perfusion by injecting the vector into the target organ, or into blood vessels supplying this organ (e.g., for the liver, the portal vein (Tada, et al., Liver Transpl. Surg. 4:78-88, 1998) could be used, or in the case of leukemia, the blood itself may be the delivery target.

[0877] RNA Interference

[0878] Alternatively, a naturally-occurring nucleic acid sequence corresponding to any of the nucleic acid sequences on the invention may be inactivated using RNA interference (“RNAi”). RNAi is a form of post-transcriptional gene silencing initiated by the introduction of double-stranded RNA (dsRNA) or antisense RNA. RNAi was first characterized in C. elegans, where many expressed genes are subject to inactivation by RNAi (Fire et al., Nature 391:806-11, 1998; Fraser et al., Nature 408:325-30, 2000). This effect has also been observed in a variety of other organisms including Drosophila melanogaster and mammals.

[0879] In particular, short twenty-one to twenty-five nucleotide double-stranded RNAs are effective at down-regulating gene expression in nematodes (Zamore et al., Cell 101: 25-33) and in mammalian tissue culture cell lines (Elbashir et al., Nature 411:494-498, 2001, hereby incorporated by reference). The further therapeutic effectiveness of this approach in mammals was demonstrated in vivo by McCaffrey et al. (Nature 418:38-39, 2002). The nucleic acid sequence of a gene descibed herein can be used to design small interfering RNAs (“siRNAs”) that will inactivate a naturally-occurring gene that contains the specific 21 to 25 nucleotide RNA sequences used.

[0880] Test Compounds

[0881] Compounds that may be tested for the ability to modulate the expression of target genes, or of their gene products, can be from natural as well as synthetic sources. Those skilled in the field or drug discovery and development will understand that the precise source of test extracts or compounds is not critical to the methods of the invention. Examples of such extracts or compounds include, but are not limited to, plant-based, fungal-based, prokaryotic-based, or animal-based extracts, fermentation broths, and synthetic compounds, as well as modification of existing compounds. Numerous methods are also available for generating random or directed synthesis (e.g., semi-synthesis or total synthesis) of any number of chemical compounds, including, but not limited to, saccharide-based, lipid-based, peptide-based, and nucleic acid-based compounds. Synthetic compound libraries are commercially available from Brandon Associates (Merrimack, N.H.) and Aldrich Chemical (Milwaukee, Wis.). Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant, and animal extracts are commercially available from a number of sources, including Biotics (Sussex, UK), Xenova (Slough, UK), Harbor Branch Oceanographics Institute (Ft. Pierce, Fla.), and Pharmamar, U.S.A. (Cambridge, Mass.). In addition, natural and synthetically produced libraries are produced, if desired, according to methods known in the art, e.g., by standard extraction and fractionation methods. Furthermore, if desired, any library or compound is readily modified using standard chemical, physical, or biochemical methods.

[0882] A test compound that modulates the expression of a target gene, or its encoded protein, may be used to treat any of the diseases and disorders described above.

[0883] Diagnosis

[0884] The methods of the present invention can be used to diagnose a disease or disorder, or the propensitiy to develop a disease or disorder.

[0885] A genetic lesion in a candidate gene may be identified in a biological sample obtained from a patient using a variety of methods available to those skilled in the art. Generally, these techniques involve PCR amplification of nucleic acid from the patient sample, followed by identification of the genetic lesion by altered hybridization, aberrant electrophoretic gel migration, restriction fragment length polymorphism (“RFLP”) analysis, binding or cleavage mediated by mismatch binding proteins, or direct nucleic acid sequencing. Any of these techniques may be used to facilitate detection of a genetic lesion in a candidate gene, and each is well known in the art; examples of particular techniques are described, without limitation, in Orita et al. (Proc. Natl. Acad. Sci. USA 86:2766-2770, 1989) and Sheffield et al. (Proc. Natl. Acad. Sci. USA 86:232-236, 1989). Furthermore, expression of the candidate gene in a biological sample (e.g., a biopsy) may be monitored by standard Northern blot analysis or may be aided by PCR (see, e.g., Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, New York, 2000; PCR Technology: Principles and Applications for DNA Amplification, H. A. Ehrlich, Ed., Stockton Press, NY; and Yap et al., Nucl. Acids. Res. 19:4294, 1991).

[0886] Once a genetic lesion is identified using the methods of the invention (as is described above), the genetic lesion is analyzed for association with a particular disease or disorder, for example, a proliferative disorder or an increased risk of developing such a disorder.

[0887] Furthermore, antibodies against a protein produced by the gene included in the genetic lesion, for example, a protein encoded by any of the nucleic acid sequences described herein, may be used to detect altered expression levels of the protein, including a lack of expression, or a change in its mobility on a gel, indicating a change in structure or size. In addition, antibodies may be used for detecting an alteration in the expression pattern or the sub-cellular localization of the protein. Such antibodies include ones that recognize both a wild-type and a mutant protein, as well as ones that are specific for either the wild-type or an altered form of the protein. If desired, monoclonal antibodies may also be prepared using the proteins described above and standard hybridoma technology (see, e.g., Kohler et al., Nature 256:495, 1975); Kohler et al., Eur. J. Immunol. 6:511, 1976); Kohler et al., Eur. J. Immunol. 6:292, 1976); Hammerling et al., In Monoclonal Antibodies and T Cell Hybridomas, Elsevier, New York, N.Y., 1981; Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, New York, 2000)). The specificicity of a monoclonal antibody may be tested by Western blot or immunoprecipitation analysis (by the methods described in, for example, Ausubel et al. (Current Protocols in Molecular Biology, John Wiley & Sons, New York, 2000).

[0888] Antibodies used in the methods of the invention may be produced using amino acid sequences that do not reside within highly conserved regions, and that appear likely to be antigenic, as analyzed by criteria such as those provided by the Peptide Structure Program (Genetics Computer Group Sequence Analysis Package, Program Manual for the GCG Package, Version 7, 1991) using the algorithm of Jameson and Wolf (CABIOS 4:181, 1988). These fragments can be generated by standard techniques, e.g., by the PCR, and cloned into the pGEX expression vector (Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, New York, 2000). GST fusion proteins are expressed in E. coli and purified using a glutathione agarose affinity matrix as described in Ausubel et al. (Current Protocols in Molecular Biology, John Wiley & Sons, New York, 2000).

[0889] Use of Transgenic and Knockout Animals in Diagnosis

[0890] The disclosed transgenic and knock-out animals may be used as research tools to determine genetic and physiological features of a disease or disorder, and for identifying compounds that can affect such diseases or disorders. For example, a zebrafish embryo, for example, a fertilized egg, or a one-day old embryo, that is mutant, either homozygous or heterozygous, for a 904, POU2, 40S Ribosomal Protein S18, U2AF, 954, Nrp-1, Cad-1, V-ATPase Alpha Subunit, V-ATPase SFD Subunit, V-ATPase 16 kDa Proteolytic Subunit, 1463, VPSP18, Pescadillo, HNF1-β/vHNF1, 60S Ribosomal Protein L35, 60S Ribosomal Protein L44, CopZ1, 215, 307, 572, 1116A, 1548, Casein Kinase 1 a, Nodal-Related or Squint, Smoothened, 429, 428, Spinster, Glypican-6 or Knypek, Ribonucleotide Reductase Protein R1 Class 1, Kinesin-Related Motor Protein EG5, 459, Wnt5 or Pipetail, Aryl Hydrocarbon Receptor Nuclear Translocator 2A, Vesicular Integral-Membrane Protein VIP 36, 299, 994, 1373, Denticleless, Ribonucleotide Reductase Protein R2, TCP-1 Alpha, Telomeric Repeat Factor 2, SIL, U1 Small Nuclear Ribonucleoprotein C, Ski Interacting Protein, 297, TCP-1 Complex Gamma Chain, Small Nuclear Ribonucleoprotein D1, DNA Polymerase Epsilon Subunit B, 821-02, 1045, 1055-01, Spliceosome Associated Protein 49, DNA Replication Licensing Factor MCM7, DEAD-Box RNA Helicase (DEAD5 or DEAD19), 1581, Cyclin A2, ISWI/SNF2, Chromosomal Assembly Protein C (XCAP-C), DNA Replication Licensing Factor MCM2, DNA Replication Licensing Factor MCM3, Valyl-tRNA Synthase, 40S Ribosomal Protein S5, TCP-1 Beta, TCP-1 Eta, Translation Elongation Factor eEF1 Alpha, 1257, 60S Ribosomal Protein L24, Non-Muscle Adenylosuccinate Synthase, Nuclear Cap Binding Protein Subunit 2, Ornithine Decarboxylase, Protein Phosphatase 1 Nuclear Targeting Subunit (PNUTS), Mitochondrial Inner Membrane Translocating Protein (rTIM23), 1447, ARS2, Sec61 Alpha, BAF53a, Histone Deacetylase, Fibroblast Isoform of the ADP/ATP Carrier Protein, or a TAFII-55 nucleic acid or amino acid sequence may be contacted with a test compound described herein, followed by observing the development of this embryo. In this manner, test compounds can be identified that decrease or eliminate the developmental abnormalities and such test compounds would serve as candidate compounds for treating a related human disease or disorder.

[0891] Knockout animals also include animals where this normal gene has been inactivated or removed and replaced with a known polymorphic or other mutant allele of this gene. These animals can serve as a model system for the risk of acquiring a disease that is associated with a particular allele. In general, the method of identifying markers associated with a proliferative disorder, involves comparing the presence, absence, or level of expression of genes, either at the RNA level or at the protein level, in tissue from a transgenic or knock-out animal and in tissue from a matching non-transgenic or knock-out animal. Standard techniques for detecting RNA expression, e.g., by Northern blotting, or protein expression, e.g., by Western blotting, are well known in the art. Differences between animals such as the presence, absence, or level of expression of a gene indicate that the expression of the gene is a marker associated with a disorder. Identification of such markers would be useful since they are possible therapeutic targets. Identification of markers can take several forms.

[0892] One method by which molecular markers may be identified is by use of directed screens. Patterns of accumulation of a variety of molecules can be surveyed using immunohistochemical methods. Screens directed at analyzing expression of specific genes or groups of molecules implicated in pathogenesis can be continued during the life of the transgenic or knockout animal. Expression can be monitored by immunohistochemistry as well as by protein and RNA blotting techniques.

[0893] Alternatively, molecular markers may be identified using genomic screens. For example, tissue can be recovered from young transgenic or knockout animals and older transgenic or knockout animals, and compared with similar material recovered from age-matched normal littermate controls to catalog genes that are induced or repressed as disease is initiated, and as disease progresses to its final stages. These surveys will generally include cellular populations present in the affected tissue.

[0894] This analysis can also be extended to include an assessment of the effects of various treatments on differential gene expression (“DGE”). The information derived from the surveys of DGE can ultimately be correlated with disease initiation and progression in the transgenic or knockout animals.

[0895] To assess the effectiveness of a treatment paradigm, a transgene, such as a mutant of any of the nucleic acid sequences described herein, may be conditionally expressed (e.g., in a tetracycline sensitive manner). For example, the promoter for this gene may contain a sequence that is regulated by tetracycline and expression of the gene product ceases when tetracycline is administered to the mouse. In this example, a tetracycline-binding operator, tetO, is regulated by the addition of tetracycline, or an analog thereof, to the organism's water or diet. The tetO may be operably-linked to a coding region, for example, a wild-type or mutant nucleic acid sequence described herein. The system also may include a tetracycline transactivator (“tTA”), which contains a DNA binding domain that is capable of binding the tetO as well as a polypeptide capable of repressing transcription from the tetO (e.g., the tetracycline repressor (“tetR”)), and may be further coupled to a transcriptional activation domain (e.g., VP16). When the tTA binds to the tetO sequences, in the absence of tetracycline, transcription of the target gene is activated. However, binding of tetracycline to the tTA prevents activation. Thus, a gene operably-linked to a tetO is expressed in the absence of tetracycline and is repressed in its presence. Alternatively, this system could be modified such that a gene is expressed in the presence of tetracycline and repressed in its absence. Tetracycline regulatable systems are well known to those skilled in the art and are described in, for example, WO 94/29442, WO 96/40892, WO 96/01313, and Yamamoto et al. (Cell 101:57-66, 2000).

[0896] In addition, the knockout organism may be a conditional, i.e., somatic, knockout. For example, FRT sequences may be introduced into the organism so that they flank the gene of interest. Transient or continuous expression of the FLP protein may then be used to induce site-directed recombination, resulting in the excision of the gene of interest. The use of the FLP/FRT system is well established in the art and is described in, for example, U.S. Pat. No. 5,527,695, and in Lyznik et al. (Nucleic Acid Research 24:3784-3789, 1996).

[0897] Conditional, i.e., somatic knockout organisms may also be produced using the Cre-lox recombination system. Cre is an enzyme that excises DNA between two recognition sites termed loxP. The cre transgene may be under the control of an inducible, developmentally regulated, tissue specific, or cell-type specific promoter. In the presence of Cre, the gene, for example a nucleic acid sequence described herein, flanked by loxP sites is excised, generating a knockout. This system is described, for example, in Kilby et al. (Trends in Genetics 9:413-421, 1993).

[0898] Particularly desirable is a mouse model wherein an altered nucleic acid sequence described herein is expressed in specific cells of the transgenic mouse such that the transgenic mouse develops a disease or disorder. In addition, cell lines from these mice may be established by methods standard in the art.

[0899] Construction of transgenes can be accomplished using any suitable genetic engineering technique, such as those described in Ausubel et al. (Current Protocols in Molecular Biology, John Wiley & Sons, New York, 2000). Many techniques of transgene construction and of expression constructs for transfection or transformation in general are known and may be used for the disclosed constructs.

[0900] One skilled in the art will appreciate that a promoter is chosen that directs expression of the chosen gene in the tissue in which a disease or disorder is expected to develop. For example, as noted above, any promoter that regulates expression of a nucleic acid sequence described herein can be used in the expression constructs of the present invention. One skilled in the art would be aware that the modular nature of transcriptional regulatory elements and the absence of position-dependence of the function of some regulatory elements, such as enhancers, make modifications such as, for example, rearrangements, deletions of some elements or extraneous sequences, and insertion of heterologous elements possible. Numerous techniques are available for dissecting the regulatory elements of genes to determine their location and function. Such information can be used to direct modification of the elements, if desired. It is desirable, however, that an intact region of the transcriptional regulatory elements of a gene is used. Once a suitable transgene construct has been made, any suitable technique for introducing this construct into embryonic cells can be used.

[0901] Animals suitable for transgenic experiments can be obtained from standard commercial sources such as Taconic (Germantown, N.Y.). Many strains are suitable, but Swiss Webster (Taconic) female mice are desirable for embryo retrieval and transfer. B6D2F (Taconic) males can be used for mating and vasectomized Swiss Webster studs can be used to stimulate pseudopregnancy. Vasectomized mice and rats are publicly available from the above-mentioned suppliers. However, one skilled in the art would also know how to make a transgenic mouse or rat. An example of a protocol that can be used to produce a transgenic animal is provided below.

[0902] Production of Transgenic Mice and Rats

[0903] The following is but one desirable means of producing transgenic mice. This general protocol may be modified by those skilled in the art.

[0904] Female mice six weeks of age are induced to superovulate with a 5 IU injection (0.1 cc, IP) of pregnant mare serum gonadotropin (PMSG; Sigma) followed 48 hours later by a 5 IU injection (0.1 cc, IP) of human chorionic gonadotropin (hCG, Sigma). Females are placed together with males immediately after hCG injection. Twenty-one hours after hCG injection, the mated females are sacrificed by CO₂ asphyxiation or cervical dislocation and embryos are recovered from excised oviducts and placed in Dulbecco's phosphate buffered saline with 0.5% bovine serum albumin (BSA, Sigma). Surrounding cumulus cells are removed with hyaluronidase (1 mg/ml). Pronuclear embryos are then washed and placed in Earle's balanced salt solution containing 0.5% BSA (EBSS) in a 37.5° C. incubator with humidified atmosphere at 5% CO₂, 95% air until the time of injection. Embryos can be implanted at the two-cell stage.

[0905] Randomly cycling adult female mice are paired with vasectomized males. Swiss Webster or other comparable strains can be used for this purpose. Recipient females are mated at the same time as donor females. At the time of embryo transfer, the recipient females are anesthetized with an intraperitoneal injection of 0.015 ml of 2.5% avertin per gram of body weight. The oviducts are exposed by a single midline dorsal incision. An incision is then made through the body wall directly over the oviduct. The ovarian bursa is then torn with watchmakers forceps. Embryos to be transferred are placed in DPBS (Dulbecco's phosphate buffered saline) and in the tip of a transfer pipet (about 10 to 12 embryos). The pipet tip is inserted into the infundibulum and the embryos are transferred. After the transferring the embryos, the incision is closed by two sutures.

[0906] A desirable procedure for generating transgenic rats is similar to that described above for mice (Hammer et al., Cell 63:1099-112, 1990). For example, thirty-day old female rats are given a subcutaneous injection of 20 IU of PMSG (0.1 cc) and 48 hours later each female placed with a proven, fertile male. At the same time, 40-80 day old females are placed in cages with vasectomized males. These will provide the foster mothers for embryo transfer. The next morning females are checked for vaginal plugs. Females who have mated with vasectomized males are held aside until the time of transfer. Donor females that have mated are sacrificed (CO₂ asphyxiation) and their oviducts removed, placed in DPBA (Dulbecco's phosphate buffered saline) with 0.5% BSA and the embryos collected. Cumulus cells surrounding the embryos are removed with hyaluronidase (1 mg/ml). The embryos are then washed and placed in EBSs (Earle's balanced salt solution) containing 0.5% BSA in a 37.5° C. incubator until the time of microinjection.

[0907] Once the embryos are injected, the live embryos are moved to DPBS for transfer into foster mothers. The foster mothers are anesthetized with ketamine (40 mg/kg, IP) and xulazine (5 mg/kg, IP). A dorsal midline incision is made through the skin and the ovary and oviduct are exposed by an incision through the muscle layer directly over the ovary. The ovarian bursa is torn, the embryos are picked up into the transfer pipet, and the tip of the transfer pipet is inserted into the infundibulum. Approximately 10 to 12 embryos are transferred into each rat oviduct through the infundibulum. The incision is then closed with sutures, and the foster mothers are housed singly.

[0908] Generation of Knockout Mice

[0909] The following is but one example for the generation of a knockout mouse and the protocol may be readily adapted or modified by those skilled in the art.

[0910] Embryonic stem cells (ES), for example, 10⁷ AB1 cells, may be electroporated with 25 μg targeting construct in 0.9 ml PBS using a Bio-Rad Gene Pulser (500μF, 230 V). The cells may then be plated on one or two 10-cm plates containing a monolayer of irradiated STO feeder cells. Twenty-four hours later, they may be subjected to G418 selection (350 μg/ml, Gibco) for 9 days. Resistant clones may then be analyzed by Southern blotting after Hind III digestion, using a probe specific to the targeting construct. Positive clones are expanded and injected into C57BL/6 blastocysts. Male chimeras may be back-crossed to C57BL/6 females. Heterozygotes may be identified by Southern blotting and intercrossed to generate homozygotes.

[0911] The targeting construct may result in the disruption of the gene of interest, e.g., by insertion of a heterologous sequence containing stop codons, or the construct may be used to replace the wild-type gene with a mutant form of the same gene, e.g., a “knock-in.” Furthermore, the targeting construct may contain a sequence that allows for conditional expression of the gene of interest. For example, a sequence may be inserted into the gene of interest that results in the protein not being expressed in the presence of tetracycline. Such conditional expression of a gene is described in, for example, Yamamoto et al. (Cell 101:57-66, 2000)).

1 159 1 1664 DNA Danio rerio 1 tcatccgtgg gcgttggggg ctgctgttta cagacccgag cgggcggttt gtttcgagca 60 tgaaggactg cgcttttttc accctaccgc cattcattca ggaattatat ctgaactcga 120 gtaaaccaga aggttttaat caaagtgctt cagccggact gtttttgcaa acgcttttcc 180 tgcacagaca ttctgaaccg cgtcgcgcga agttggacca tctgttgaac gtttgcgagc 240 gcaccgcaaa gttctcaacg cgtttatttt ttaacattta atcgcaaaac aactgttgtt 300 gtttcctaaa gcgaacgttt ttgttgcgtt tatgctaacg ccaagcaact ttggagaggt 360 tagctatagc tggttagcct agactcgcta actggcctct gctaaaaggg ccgttagtga 420 cctaaacgta ttgttgttct acatttattc ataacgtgta ataataataa ctacaaagtt 480 tgttttctaa tggtctctgt acgttatatt taactaaaac agttgcattt atgagtcgac 540 aacttgagca gttgtgaaaa gcattagcac tccaaaaaac ttcagttaaa cattttcctt 600 aaggaattaa cgtaagcatt ctctctgttt tgctatacag caagagtgac ctccatcatt 660 ggccatatgt aagacgaacc agtcctttta aaacccaata ttctcaaagc tggctgaacc 720 gccagctaaa agtaacttaa atcgcaactt ttgctcgtaa atgaccaccg gcaggaataa 780 ccgagtgatg atggaaggag tgggtgctcg agttatccgc gggcctgact ggaagtgggg 840 gaagcaggac ggcggagaag ggcatgtggg caccgtcagg agcttcgaga gcccggagga 900 ggtggtggtg gtgtgggata acggcaccgc cgccaattac cgctgctccg gggcttatga 960 tgtgaggata ttagacagtg cgcctacagg aatcaagcat gatggcacca tgtgtgacac 1020 ctgtcggcag cagcccatca ttggaatccg atggaaatgt gcagagtgca ccaactacga 1080 cctctgtacc acctgttacc atggcgacaa gcaccacctg cgccacaggt tttaccgcat 1140 caccacccct ggcagtgaaa gagttcttct ggagtctcgt cgcaagtcga agaagatcac 1200 agcccgcggc atctttgcag gaggacgagt ggtgcgaggg gtggattggc aatgggagga 1260 tcaggatggc ggaaatggaa gaagagggaa ggtgacggag atccaggact ggagtgctgc 1320 cagcccccac agcgcagcct acgtcctttg ggacaacggg gccaagaacc tctaccgagt 1380 gggattcgag ggcatgtctg atttgaaatg tgtgcaagat gccaaaggag ggacttttta 1440 cagagaccac tgtccagtat tgggtaagat ttaaaatatc tgatattacc aaaatatacc 1500 atgccgattc aatgaatata cagtggggga aataagtatt gagcacgtca ccatttttct 1560 cagaaaacaa atttctaaag gtgctgttga cttgactttt caccagatgt tggtaacaac 1620 caaagaaatc catatatgcc aaaaaaaaaa aaaaaaaaaa aaaa 1664 2 237 PRT Danio rerio 2 Met Thr Thr Gly Arg Asn Asn Arg Val Met Met Glu Gly Val Gly Ala 1 5 10 15 Arg Val Ile Arg Gly Pro Asp Trp Lys Trp Gly Lys Gln Asp Gly Gly 20 25 30 Glu Gly His Val Gly Thr Val Arg Ser Phe Glu Ser Pro Glu Glu Val 35 40 45 Val Val Val Trp Asp Asn Gly Thr Ala Ala Asn Tyr Arg Cys Ser Gly 50 55 60 Ala Tyr Asp Val Arg Ile Leu Asp Ser Ala Pro Thr Gly Ile Lys His 65 70 75 80 Asp Gly Thr Met Cys Asp Thr Cys Arg Gln Gln Pro Ile Ile Gly Ile 85 90 95 Arg Trp Lys Cys Ala Glu Cys Thr Asn Tyr Asp Leu Cys Thr Thr Cys 100 105 110 Tyr His Gly Asp Lys His His Leu Arg His Arg Phe Tyr Arg Ile Thr 115 120 125 Thr Pro Gly Ser Glu Arg Val Leu Leu Glu Ser Arg Arg Lys Ser Lys 130 135 140 Lys Ile Thr Ala Arg Gly Ile Phe Ala Gly Gly Arg Val Val Arg Gly 145 150 155 160 Val Asp Trp Gln Trp Glu Asp Gln Asp Gly Gly Asn Gly Arg Arg Gly 165 170 175 Lys Val Thr Glu Ile Gln Asp Trp Ser Ala Ala Ser Pro His Ser Ala 180 185 190 Ala Tyr Val Leu Trp Asp Asn Gly Ala Lys Asn Leu Tyr Arg Val Gly 195 200 205 Phe Glu Gly Met Ser Asp Leu Lys Cys Val Gln Asp Ala Lys Gly Gly 210 215 220 Thr Phe Tyr Arg Asp His Cys Pro Val Leu Gly Lys Ile 225 230 235 3 2361 DNA Danio rerio 3 gattggccca ttctgattga tggccgccac acaaggactg cgcacatttc cacacaggct 60 gatactgaga cgcgcaatct gagtgtagtc tgtttggatc atcctgggat ggagcaactg 120 atcgacgttt attgattatc aaaccagcag tctttacgct ttcttgtttg aaatctcaac 180 aaccttttta gcggaaagat gacggagaga gcgcagagcc caacagcagc agactgcaga 240 ccctatgagg tcaacagggc catgtatcct caagccgcgg gcctggatgg acttggcgga 300 gcgtccttgc agtttgcgca cggtatgctt caggatccaa gtctgatttt taacaaggcc 360 catttcaacg gaatcacccc cgcgacagcc cagaccttct ttccattttc aggcgatttt 420 aaaacgaacg atttgcaagg tggcgacttt acgcagccca aacactggta cccgtttgcg 480 gcccccgagt tcactgggca ggttgcagga gcgactgccg ccactcagcc ggcgaacatc 540 agccctccta tcggcgagac tagagagcaa attaagatgc catctgaggt caaaaccgag 600 aaagatgttg aagaatacgg gaatgaagaa aacaagccgc cgtcacaata tcacctcacc 660 gctggaacat cttccatccc caccggggtg aactactaca cgccatggaa ccctaatttc 720 tggcctggac tgtcccaaat tacggcccaa gctaatattt cccaagctcc cccaactccc 780 tccgcttcat ccccatctct gtctccgtct ccccctggaa atgggttcgg aagcccagga 840 ttttttagcg gaggcaccgc gcaaaacatt ccctcagctc aagcgcaaag tgcaccccgg 900 agcagtgggt cctccagtgg aggatgcagt aattctgagg aagaggagac tctgactact 960 gaagatttgg agcagtttgc gaaagagctc aaacacaagc gcatcactct gggcttcacg 1020 caggcagatg tgggactcgc gcttggaaac ttgtatggca aaatgttcag tcagacgaca 1080 atctgccgct ttgaggctct ccaacttagt ttcaagaaca tgtgcaaact gaagccgctg 1140 ctgcagaggt ggttgaacga ggccgaaaac tccgagaacc ctcaggatat gtacaaaatt 1200 gaacgggtgt ttgtcgacac gcgaaaaaga aaacggagga ccagcttgga aggcacagtc 1260 cgttctgctc tagagtcgta cttcgtgaag tgccccaaac ccaacactct ggagataacg 1320 cacatatcgg atgatctagg cctggagaga gatgtagtgc gtgtatggtt ctgcaaccgt 1380 agacagaagg gaaagcgtct agctttgccc tttgatgacg agtgtgttga agcacagtat 1440 tacgagcaga gtccaccacc tccaccccac atgggtggca ctgtgctccc aggtcaaggc 1500 tatcctggac cagcccatcc tggaggagcc cctgccttat acatgccatc cctccaccga 1560 ccagatgtct tcaaaaacgg ctttcaccct ggtttggtgg gtcatctcaa cagctaactt 1620 gctccaggtg ctcactaaag tttatagaaa agctccaagc caagctatgg atcaaatcag 1680 ccatgcttat aattctttta attttatttt gtaacactgc atctgttcat ttggttcttt 1740 ccatacgtgc cttgagacac ttatgttagt gtctggttga ccatcacagt aggaacctga 1800 tggattattg tgttatgagt gctttgttga actactgcgt ttgcacatga agtcataggc 1860 tcatttttat agttgttttt aacaatgttt gctcatgtat cggtttttat gtacatatat 1920 ttgtcttaaa cgtttgatat ttgatgattt accaaagtat atttccatat ttgtctccat 1980 aactttgtag ttttcccgta tttgtgtgaa ctggcagcaa attcaagact tgatgaaatt 2040 taaatttgac tttctttttc cctttttttt ttttttaatt ttaatgacgt gatggatttt 2100 ttttttatta cgatttggag aacttttatc aaatgatttg acaagaacat tagtcactgt 2160 tctttttcac ctctctaaat gcttgcactc tcccatctcc aagtgaaatg cctctagatg 2220 tactctttat cagggtggtt atacataaat atgctgccag tttgaaagcc tctctgtgga 2280 ggtcatgtgc cttatctttc aatgtctcaa ttcatgcatg ctcacccgca gtaaattaaa 2340 gccatatttg ataatttatt g 2361 4 472 PRT Danio rerio 4 Met Thr Glu Arg Ala Gln Ser Pro Thr Ala Ala Asp Cys Arg Pro Tyr 1 5 10 15 Glu Val Asn Arg Ala Met Tyr Pro Gln Ala Ala Gly Leu Asp Gly Leu 20 25 30 Gly Gly Ala Ser Leu Gln Phe Ala His Gly Met Leu Gln Asp Pro Ser 35 40 45 Leu Ile Phe Asn Lys Ala His Phe Asn Gly Ile Thr Pro Ala Thr Ala 50 55 60 Gln Thr Phe Phe Pro Phe Ser Gly Asp Phe Lys Thr Asn Asp Leu Gln 65 70 75 80 Gly Gly Asp Phe Thr Gln Pro Lys His Trp Tyr Pro Phe Ala Ala Pro 85 90 95 Glu Phe Thr Gly Gln Val Ala Gly Ala Thr Ala Ala Thr Gln Pro Ala 100 105 110 Asn Ile Ser Pro Pro Ile Gly Glu Thr Arg Glu Gln Ile Lys Met Pro 115 120 125 Ser Glu Val Lys Thr Glu Lys Asp Val Glu Glu Tyr Gly Asn Glu Glu 130 135 140 Asn Lys Pro Pro Ser Gln Tyr His Leu Thr Ala Gly Thr Ser Ser Ile 145 150 155 160 Pro Thr Gly Val Asn Tyr Tyr Thr Pro Trp Asn Pro Asn Phe Trp Pro 165 170 175 Gly Leu Ser Gln Ile Thr Ala Gln Ala Asn Ile Ser Gln Ala Pro Pro 180 185 190 Thr Pro Ser Ala Ser Ser Pro Ser Leu Ser Pro Ser Pro Pro Gly Asn 195 200 205 Gly Phe Gly Ser Pro Gly Phe Phe Ser Gly Gly Thr Ala Gln Asn Ile 210 215 220 Pro Ser Ala Gln Ala Gln Ser Ala Pro Arg Ser Ser Gly Ser Ser Ser 225 230 235 240 Gly Gly Cys Ser Asn Ser Glu Glu Glu Glu Thr Leu Thr Thr Glu Asp 245 250 255 Leu Glu Gln Phe Ala Lys Glu Leu Lys His Lys Arg Ile Thr Leu Gly 260 265 270 Phe Thr Gln Ala Asp Val Gly Leu Ala Leu Gly Asn Leu Tyr Gly Lys 275 280 285 Met Phe Ser Gln Thr Thr Ile Cys Arg Phe Glu Ala Leu Gln Leu Ser 290 295 300 Phe Lys Asn Met Cys Lys Leu Lys Pro Leu Leu Gln Arg Trp Leu Asn 305 310 315 320 Glu Ala Glu Asn Ser Glu Asn Pro Gln Asp Met Tyr Lys Ile Glu Arg 325 330 335 Val Phe Val Asp Thr Arg Lys Arg Lys Arg Arg Thr Ser Leu Glu Gly 340 345 350 Thr Val Arg Ser Ala Leu Glu Ser Tyr Phe Val Lys Cys Pro Lys Pro 355 360 365 Asn Thr Leu Glu Ile Thr His Ile Ser Asp Asp Leu Gly Leu Glu Arg 370 375 380 Asp Val Val Arg Val Trp Phe Cys Asn Arg Arg Gln Lys Gly Lys Arg 385 390 395 400 Leu Ala Leu Pro Phe Asp Asp Glu Cys Val Glu Ala Gln Tyr Tyr Glu 405 410 415 Gln Ser Pro Pro Pro Pro Pro His Met Gly Gly Thr Val Leu Pro Gly 420 425 430 Gln Gly Tyr Pro Gly Pro Ala His Pro Gly Gly Ala Pro Ala Leu Tyr 435 440 445 Met Pro Ser Leu His Arg Pro Asp Val Phe Lys Asn Gly Phe His Pro 450 455 460 Gly Leu Val Gly His Leu Asn Ser 465 470 5 543 DNA Danio rerio 5 taaaaaaaaa aaaaaaaaaa aaaactcaag atgtccctcg tcatcccaga gaagtttcag 60 cacatccttc gtgtcctcaa cacgaacatt gatggaagac gtaaaatcgc atttgccatc 120 accgctatta agggtgttgg aaggcgatat gcccatgtgg ttctgaggaa agctgatatt 180 gatttaaata agagggctgg agaactcact gaggatgagg ttgagagggt ggtgactatt 240 atgcagaacc ctcgccagta caaaatccca gactggttcc tgaacagaca gaaggacata 300 aaagatggga aatacagcca ggtccttgct aatggtctgg acaataaact gagagaagat 360 ctggagaggc tgaagaagat cagggctcac cgtggtctcc gtcacttctg gggtctgcgt 420 gtgcgtggtc agcacaccaa aacaactggt cgtcgcggtc gcactgtggg tgtgtccaag 480 aagaagtaaa tttacaaata aaggaaatca ttcagtaaaa aaaaaaaaaa aaaaaaaaaa 540 aaa 543 6 152 PRT Danio rerio 6 Met Ser Leu Val Ile Pro Glu Lys Phe Gln His Ile Leu Arg Val Leu 1 5 10 15 Asn Thr Asn Ile Asp Gly Arg Arg Lys Ile Ala Phe Ala Ile Thr Ala 20 25 30 Ile Lys Gly Val Gly Arg Arg Tyr Ala His Val Val Leu Arg Lys Ala 35 40 45 Asp Ile Asp Leu Asn Lys Arg Ala Gly Glu Leu Thr Glu Asp Glu Val 50 55 60 Glu Arg Val Val Thr Ile Met Gln Asn Pro Arg Gln Tyr Lys Ile Pro 65 70 75 80 Asp Trp Phe Leu Asn Arg Gln Lys Asp Ile Lys Asp Gly Lys Tyr Ser 85 90 95 Gln Val Leu Ala Asn Gly Leu Asp Asn Lys Leu Arg Glu Asp Leu Glu 100 105 110 Arg Leu Lys Lys Ile Arg Ala Pro Arg Gly Leu Arg His Phe Trp Gly 115 120 125 Leu Arg Val Arg Gly Gln His Thr Lys Thr Thr Gly Arg Arg Gly Arg 130 135 140 Thr Val Gly Val Ser Lys Lys Lys 145 150 7 938 DNA Danio rerio 7 aatggccgaa tacttggcgt ctattttcgg tacagaaaaa gacaaggtca actgctcttt 60 ctactttaaa attggagcct gtcgccatgg agaccgatgc tctagacttc ataataaacc 120 caccttcagt cagaccattg ccctgttgaa catctacagg aacccacaaa acacagccca 180 gtctgctgat ggcttaaatg ctgtcagtga tgtggagatg caggaacatt atgatgagtt 240 ctttgaggag gtcttcactg aaatggagga gaaatatgga gaggttgagg agatgaacgt 300 ctgcgataat ttaggagatc acttggtggg aaatgtctat gtgaagttcc gccgtgaaga 360 agatgcagag aaagcggtga ttaacctgaa caaccgctgg tttaatgggc agcccattca 420 tgcggagctc tcgcctgtca ctgacttcag agaggcctgt tgtcgacagt atgagatggg 480 agagtgcact cgaggaggct tctgcaattt catgcatctg aagccaatct caagggaact 540 aaggagagag ctgtacggcc gcagaaggaa gagacatcgg tcccgctctc gctcccgtga 600 acgacgctct cgctctaggg gacgcaataa aggagttggt ggtgcagcag caggcggtgg 660 tggtggcggc ggtggtggtg gtggaggtgg aggaggaaga gaccgagaga gacggcggtc 720 aagagataga gagcgttctg gaagatttta aaatttcctt ctttcttgaa atcaacattc 780 agtcaccaga cccttcagct ctctccctga tgaaaatgtg ataactgttg aaattgtttc 840 cttctttaat tgtcctctga tctttttatt caacgtttgt ttgcttttca ataaacaaac 900 accttagtcg tttcaaaaaa aaaaaaaaag ggcggccg 938 8 249 PRT Danio rerio 8 Met Ala Glu Tyr Leu Ala Ser Ile Phe Gly Thr Glu Lys Asp Lys Val 1 5 10 15 Asn Cys Ser Phe Tyr Phe Lys Ile Gly Ala Cys Arg His Gly Asp Arg 20 25 30 Cys Ser Arg Leu His Asn Lys Pro Thr Phe Ser Gln Thr Ile Ala Leu 35 40 45 Leu Asn Ile Tyr Arg Asn Pro Gln Asn Thr Ala Gln Ser Ala Asp Gly 50 55 60 Leu Asn Ala Val Ser Asp Val Glu Met Gln Glu His Tyr Asp Glu Phe 65 70 75 80 Phe Glu Glu Val Phe Thr Glu Met Glu Glu Lys Tyr Gly Glu Val Glu 85 90 95 Glu Met Asn Val Cys Asp Asn Leu Gly Asp His Leu Val Gly Asn Val 100 105 110 Tyr Val Lys Phe Arg Arg Glu Glu Asp Ala Glu Lys Ala Val Ile Asn 115 120 125 Leu Asn Asn Arg Trp Phe Asn Gly Gln Pro Ile His Ala Glu Leu Ser 130 135 140 Pro Val Thr Asp Phe Arg Glu Ala Cys Cys Arg Gln Tyr Glu Met Gly 145 150 155 160 Glu Cys Thr Arg Gly Gly Phe Cys Asn Phe Met His Leu Lys Pro Ile 165 170 175 Ser Arg Glu Leu Arg Arg Glu Leu Tyr Gly Arg Arg Arg Lys Arg His 180 185 190 Arg Ser Arg Ser Arg Ser Arg Glu Arg Arg Ser Arg Ser Arg Gly Arg 195 200 205 Asn Lys Gly Val Gly Gly Ala Ala Ala Gly Gly Gly Gly Gly Gly Gly 210 215 220 Gly Gly Gly Gly Gly Gly Gly Gly Arg Asp Arg Glu Arg Arg Arg Ser 225 230 235 240 Arg Asp Arg Glu Arg Ser Gly Arg Phe 245 9 2166 DNA Danio rerio 9 agcggaatca tcgcttatgt tatcctgctc cgagaagctg atatggagtc gctttattat 60 atttaagaca ctgctggact cttgattaat aatactggat tatcaatcaa ccttgaccgt 120 tggacaaggg aggatttaat cagaagcgat tgttattgcg tagcattttg gtaacttgga 180 tagttagcat ctttaagctg caagtctaga gtcatgtcgc taaattgcta aaggacacat 240 ttcatatcta cccaccaaaa tgccgttttt cttcctttaa ccgataaact tggattaatg 300 tggactaaag ccgaaaataa actgcccaga ctactttaac tgtcggatgc tttttgacgc 360 agtaatagtt agtaaataag ttagcattgc tagcatagtc gtgcagccgc tcttcaaata 420 gaaacttccg tgttgcacta cgaaagtccg ttatcgtatg tttttggggc tgtaaacccg 480 aagcgagtct tcctcgtgtt ttatttgcaa gtatcaggct tgaatgtgaa ctagcgtgag 540 atgcggtggt aaagaagagt tcaatgccgt gtgtgctgga ggatgatgcg gatgagctgg 600 atggtcacag tcattaacag aagaatgatg aagatcctga tcgccctcgc tctcattgcg 660 tacatcgcct ctgtctgggg aacatacgca aatatgaggt ctatacagga acacggagaa 720 atgaagattg agcagaggat agatgaggca gtgggacctc tcagggagaa gatacgggaa 780 ctcgagctta gttttaccca gaagtaccca ccagtgaaat tcttatccga gaaggaccga 840 aagaggattt tgatcaccgg tggggcagga ttcgtgggct cccacctcac cgataagctg 900 atgatggatg gtcatgaggt gactgtggtg gataatttct tcaccggccg caagcgcaat 960 gtagagcact ggattggcca tgagaacttc gagctcatca accatgatgt ggtggagccg 1020 ctctatattg aagttgacca gatatatcat ttggcctcgc ctgcatcgcc accgaactac 1080 atgtacaatc cgatcaagac actaaaaaca aatactattg gtactctcaa catgctagga 1140 ttggccaagc gggttggagc gcgcctgctt cttgcctcca cgtcagaagt gtacggagac 1200 ccagaggtgc acccccaaaa tgaggactac tggggtcacg tgaatccaat tggtccccgg 1260 gcctgttatg atgaaggaaa gcgtgttgct gagactatgt gttatgccta catgaaacag 1320 gaaggagtgg aggtgcgagt ggctcgcatc ttcaacacct tcgggtctcg catgcacatg 1380 aacgacgggc gagtggtcag caatttcatc ctgcaggctt tgcaaggaga ggcactgacg 1440 gtctacggtt cagggtctca aacaagagct ttccaatatg tgagtgacct ggtgaatgga 1500 ctggtgtctc tgatgaacag taacatcagt agccctgtta acctgggaaa tccagaggaa 1560 cacaccatct tggagtttgg cagctcatca agagtcttgt tgcgagccgg agtcatattc 1620 agttcctttc agaagcgcag gatgatccac agaggagacg gaccgacatc cgcaggccaa 1680 actgctgctg ggctgggaac ctgtggtccc actggaggaa ggcttgaaca aaaccatcca 1740 gtacttcagc cgagagctgg agcatcaagc caacaatcag tacatcccca aacccaaagc 1800 tgcccgcatg aaaaaaggaa gacccagaca caactgaaga cacggcttca tctcagggat 1860 gctgggatat ggaggacttc agacatccgg cagccacttt accaactccc actaacagat 1920 gcgtcgtttt ggcagagagg ggataagaga agtttttctg atgtttttca gtcgtatgtg 1980 cctcaaaatg atccaattta cagactttgc cttgcacttt ggtttgtgag tcgtctgtct 2040 gtctgtgtgt gtgtctgtta tttaaagcaa gcttaggtct tcatgtgcgt tatgtttaca 2100 ttctgtgctg ttttaattga tgtgatcagt tttttttttt ggattatcat tgaagctaaa 2160 ggtggg 2166 10 497 PRT Danio rerio 10 Met Met Arg Met Ser Trp Met Val Thr Val Ile Asn Arg Arg Met Met 1 5 10 15 Lys Ile Leu Ile Ala Leu Ala Leu Ile Ala Tyr Ile Ala Ser Val Trp 20 25 30 Gly Thr Tyr Ala Asn Met Arg Ser Ile Gln Glu His Gly Glu Met Lys 35 40 45 Ile Glu Gln Arg Ile Asp Glu Ala Val Gly Pro Leu Arg Glu Lys Ile 50 55 60 Arg Glu Leu Glu Leu Ser Phe Thr Gln Lys Tyr Pro Pro Val Lys Phe 65 70 75 80 Leu Ser Glu Lys Asp Arg Lys Arg Ile Leu Ile Thr Gly Gly Ala Gly 85 90 95 Phe Val Gly Ser His Leu Thr Asp Lys Leu Met Met Asp Gly His Glu 100 105 110 Val Thr Val Val Asp Asn Phe Phe Thr Gly Arg Lys Arg Asn Val Glu 115 120 125 His Trp Ile Gly His Glu Asn Phe Glu Leu Ile Asn His Asp Val Val 130 135 140 Glu Pro Leu Tyr Ile Glu Val Asp Gln Ile Tyr His Leu Ala Ser Pro 145 150 155 160 Ala Ser Pro Pro Asn Tyr Met Tyr Asn Pro Ile Lys Thr Leu Lys Thr 165 170 175 Asn Thr Ile Gly Thr Leu Asn Met Leu Gly Leu Ala Lys Arg Val Gly 180 185 190 Ala Arg Leu Leu Leu Ala Ser Thr Ser Glu Val Tyr Gly Asp Pro Glu 195 200 205 Val His Pro Gln Asn Glu Asp Tyr Trp Gly His Val Asn Pro Ile Gly 210 215 220 Pro Arg Ala Cys Tyr Asp Glu Gly Lys Arg Val Ala Glu Thr Met Cys 225 230 235 240 Tyr Ala Tyr Met Lys Gln Glu Gly Val Glu Val Arg Val Ala Arg Ile 245 250 255 Phe Asn Thr Phe Gly Ser Arg Met His Met Asn Asp Gly Arg Val Val 260 265 270 Ser Asn Phe Ile Leu Gln Ala Leu Gln Gly Glu Ala Leu Thr Val Tyr 275 280 285 Gly Ser Gly Ser Gln Thr Arg Ala Phe Gln Tyr Val Ser Asp Leu Val 290 295 300 Asn Gly Leu Val Ser Leu Met Asn Ser Asn Ile Ser Ser Pro Val Asn 305 310 315 320 Leu Gly Asn Pro Glu Glu His Thr Ile Leu Glu Phe Gly Ser Ser Ser 325 330 335 Arg Val Leu Leu Arg Ala Gly Val Ile Phe Ser Ser Phe Gln Lys Arg 340 345 350 Arg Met Ile His Arg Gly Asp Gly Pro Thr Ser Ala Gly Gln Thr Ala 355 360 365 Ala Gly Leu Gly Thr Cys Gly Pro Thr Gly Gly Arg Leu Glu Gln Asn 370 375 380 His Pro Val Leu Gln Pro Arg Ala Gly Ala Ser Ser Gln Gln Ser Val 385 390 395 400 His Pro Gln Thr Gln Ser Cys Pro His Glu Lys Arg Lys Thr Gln Thr 405 410 415 Gln Leu Lys Thr Arg Leu His Leu Arg Asp Ala Gly Ile Trp Arg Thr 420 425 430 Ser Asp Ile Arg Gln Pro Leu Tyr Gln Leu Pro Leu Thr Asp Ala Ser 435 440 445 Phe Trp Gln Arg Gly Asp Lys Arg Ser Phe Ser Asp Val Phe Gln Ser 450 455 460 Tyr Val Pro Gln Asn Asp Pro Ile Tyr Arg Leu Cys Leu Ala Leu Trp 465 470 475 480 Phe Val Ser Arg Leu Ser Val Cys Val Cys Val Cys Tyr Leu Lys Gln 485 490 495 Ala 11 1399 DNA Danio rerio 11 ctgattgagg gttaagagca agacagacgc gtgccattat cttcaacaca aactatctgc 60 aggattctgc aaaacctcaa gcatctccca gcccaccaat aaggttatca acaatggaga 120 tcgtatactc cgatatggaa acctcaagct gtgactactc cttttcgcac acggatgatg 180 aagactcgcg cagcagcctc caccccgcgt cccccgcgtc ctcctgcgga aaaccacctg 240 cgtctccagc cgggctccag cagaagaaaa ggcgcagggg gcgcgcgagg aacgaaacca 300 ctgtgcacgt cgtgaagaag aaccgcaggc tgaaggccaa cgaccgcgag aggaacagga 360 tgcacaacct taacgacgca ttggatgctt tgagaagcgt cctgcctgcg tttcctgacg 420 acacaaagct gaccaaaatt gagactctgc gcttcgctca caactacatc tgggcacttt 480 cggagaccat ccggatcgca gaccagaagc agggcaagtc aagagacggt ccgctgctgc 540 tccccggact aagctgcatg gcagatgcac ccagccccgg cagtgactct tgctcctggc 600 cgtcgggggc atcctcgtcg tcttcatcac cgtcttactg caactcagac ccgggcagcc 660 ccgcagccat ggacgatttt ggatacttgc aaaccgacgt agtgtacagc taccgcaact 720 tcgtgcctag catctattaa tgtgacttta agcgttctca tgtcggtata gaatctccag 780 aagtatgtaa tagtgtaaga ccactgttac ttttctatta gaaccaagag cacaacgtta 840 ggtattcact gtttgcctta atgacagtct gcaaatgaga atgttttatt tttgtacgtt 900 ttgatattga tgttgatgtt gatgcttttt tccaatacct ttgcactttt tgcattcctt 960 ggagcgtgac cagaaccccg ttagagaaaa ttataattta aaaagcaagg cagattggcc 1020 tttgctgtcc aaggtgaaaa gtcaatatta caatttgtag agcgataaag ctgaaaaacg 1080 agcgtgaaaa atcagtttca acgctgtgac aatagactga aaggacaaga caaagatctg 1140 gccttatgca ttatgtattt tgacccgaag aaaccctcat ctgtccgact gatccgtaca 1200 gttaaccagc tctcatgacg gctttcccac cacttttctt tttttattta ttcaacctgt 1260 tgcttcttta tgctattgta tttttattgt atataaagag atttattcta ttatgtattt 1320 acctcagaat ataaatctgg tgcaaacatg tttgtacgaa taaatatacg ttttctacta 1380 aaaaaaaaaa aaaaaaaaa 1399 12 208 PRT Danio rerio 12 Met Glu Ile Val Tyr Ser Asp Met Glu Thr Ser Ser Cys Asp Tyr Ser 1 5 10 15 Phe Ser His Thr Asp Asp Glu Asp Ser Arg Ser Ser Leu His Pro Ala 20 25 30 Ser Pro Ala Ser Ser Cys Gly Lys Pro Pro Ala Ser Pro Ala Gly Leu 35 40 45 Gln Gln Lys Lys Arg Arg Arg Gly Arg Ala Arg Asn Glu Thr Thr Val 50 55 60 His Val Val Lys Lys Asn Arg Arg Leu Lys Ala Asn Asp Arg Glu Arg 65 70 75 80 Asn Arg Met His Asn Leu Asn Asp Ala Leu Asp Ala Leu Arg Ser Val 85 90 95 Leu Pro Ala Phe Pro Asp Asp Thr Lys Leu Thr Lys Ile Glu Thr Leu 100 105 110 Arg Phe Ala His Asn Tyr Ile Trp Ala Leu Ser Glu Thr Ile Arg Ile 115 120 125 Ala Asp Gln Lys Gln Gly Lys Ser Arg Asp Gly Pro Leu Leu Leu Pro 130 135 140 Gly Leu Ser Cys Met Ala Asp Ala Pro Ser Pro Gly Ser Asp Ser Cys 145 150 155 160 Ser Trp Pro Ser Gly Ala Ser Ser Ser Ser Ser Ser Pro Ser Tyr Cys 165 170 175 Asn Ser Asp Pro Gly Ser Pro Ala Ala Met Asp Asp Phe Gly Tyr Leu 180 185 190 Gln Thr Asp Val Val Tyr Ser Tyr Arg Asn Phe Val Pro Ser Ile Tyr 195 200 205 13 1587 DNA Danio rerio 13 agctcctttt ggactattac acttccagat aaatagtatt gaagaaaaag tttttgagtc 60 gggccttgga tccaagtgat atttggcttg atgcatggct ggcatgttgc cttaaaaaaa 120 gttattttag gggtttcttc caaatcatgt acgttggata ccttttggag aaagaggcaa 180 gcatgtatca ccaaggagcg gttcgtcgat ctggcatcag tcttccacca cagaactttg 240 tttccacacc tcagtattca gattttacag gataccatca tgtgcccaat atggagacac 300 acgcacagtc ggcaggggca tggggcgctc cttatggcgc tccgagagag gactggggcg 360 cctacagcct cggacctcca aattctattt ctgcacctat gagcaattca tctccgggac 420 cagtttccta ttgctcgccc gattataaca ccatgcacgg ccctggatcg gcggttcttc 480 ctccgccacc tgaaaatatt cctgtggccc aactatcccc agagagagaa aggcgtaatt 540 cctatcagtg gatgagcaaa accgtccagt cgtcatcaac cggcaaaacg agaacgaagg 600 agaaataccg agtggtatac acagatcacc aaaggctaga gctggagaaa gagtttcatt 660 ttaatcgcta tatcacaatc agaagaaaat ccgaacttgc tgtaaacctc gggctttcag 720 agagacaggt taagatctgg tttcagaacc gcagagctaa ggaaaggaaa ttaatcaaaa 780 agaaattggg tcagtctgac ggaagcggag ggtcagtgca cagcgaccca ggctccgtca 840 gtcctctacc ggtaagggtc actgagtcct tcagatatac acggttctct gtatccagca 900 caaatgaacg ccttgccgtc catgaggaac atacagcaag ttactgtcac tcagtgaact 960 cgaacttgga tcatccaact ttaactgagc actccaacac gctgaaagga cagaaccaaa 1020 cgtggtcaaa aaggattaca aactattaat gaataaatga atgaataaat aaataagagg 1080 cgtgggttgt ggattacaac cgaagaatca cgtctcaaat gcgacgaaat gctggaggct 1140 cttgaaatat tcttttaata gaatagtttt taaaattaag aaaaaaaaac tacaaaaacc 1200 ttgttcgtct gatccagaga ctaaataggc tattttaatg gtttgtgttt tctcgaataa 1260 gatataaata tcttttttct gtatgtggtg agtgtatcta aatgtttgtt tgtatataaa 1320 atgagaacta caggtacgga gtttcatgga cgtgtaatga tccttaccct ggaactgtta 1380 aacttttatt gtctgtgaaa accaatgacc tgcgatgtca ggctgtttta tgcgccagtt 1440 tgtgtccaaa gcgcacgtcg ttcactttct tttcacaggg gtgatgtgca ggtgtcacat 1500 accagatacg ggtataacgt ttgatttcag caatgtgtaa ataaatgttt tgcaataact 1560 tttattgtgt cttcatgtat cattaaa 1587 14 300 PRT Danio rerio 14 Met Tyr Val Gly Tyr Leu Leu Glu Lys Glu Ala Ser Met Tyr His Gln 1 5 10 15 Gly Ala Val Arg Arg Ser Gly Ile Ser Leu Pro Pro Gln Asn Phe Val 20 25 30 Ser Thr Pro Gln Tyr Ser Asp Phe Thr Gly Tyr His His Val Pro Asn 35 40 45 Met Glu Thr His Ala Gln Ser Ala Gly Ala Trp Gly Ala Pro Tyr Gly 50 55 60 Ala Pro Arg Glu Asp Trp Gly Ala Tyr Ser Leu Gly Pro Pro Asn Ser 65 70 75 80 Ile Ser Ala Pro Met Ser Asn Ser Ser Pro Gly Pro Val Ser Tyr Cys 85 90 95 Ser Pro Asp Tyr Asn Thr Met His Gly Pro Gly Ser Ala Val Leu Pro 100 105 110 Pro Pro Pro Glu Asn Ile Pro Val Ala Gln Leu Ser Pro Glu Arg Glu 115 120 125 Arg Arg Asn Ser Tyr Gln Trp Met Ser Lys Thr Val Gln Ser Ser Ser 130 135 140 Thr Gly Lys Thr Arg Thr Lys Glu Lys Tyr Arg Val Val Tyr Thr Asp 145 150 155 160 His Gln Arg Leu Glu Leu Glu Lys Glu Phe His Phe Asn Arg Tyr Ile 165 170 175 Thr Ile Arg Arg Lys Ser Glu Leu Ala Val Asn Leu Gly Leu Ser Glu 180 185 190 Arg Gln Val Lys Ile Trp Phe Gln Asn Arg Arg Ala Lys Glu Arg Lys 195 200 205 Leu Ile Lys Lys Lys Leu Gly Gln Ser Asp Gly Ser Gly Gly Ser Val 210 215 220 His Ser Asp Pro Gly Ser Val Ser Pro Leu Pro Val Arg Val Thr Glu 225 230 235 240 Ser Phe Arg Tyr Thr Arg Phe Ser Val Ser Ser Thr Asn Glu Arg Leu 245 250 255 Ala Val His Glu Glu His Thr Ala Ser Tyr Cys His Ser Val Asn Ser 260 265 270 Asn Leu Asp His Pro Thr Leu Thr Glu His Ser Asn Thr Leu Lys Gly 275 280 285 Gln Asn Gln Thr Trp Ser Lys Arg Ile Thr Asn Tyr 290 295 300 15 1269 DNA Danio rerio 15 ccagtctatt ttcacggctc tgacactgtc acgatcaacc cttttcgagt aaatttaaaa 60 tctaaaggct ttcaggatta ttttcgagct gtatttgcaa agcagacgcc tttcctctgc 120 cgcaaaccga gctttatctc tcggcggagt ttaaaatccc cttaaagcag caggagcagg 180 atctgcagtc atggcgctca gcgatgccga cgtccagaaa cagatcaagc acatgatggc 240 tttcattgag caggaggcca atgaaaaagc cgaggagata gacgccaagg cggaggagga 300 gtttaacatt gagaaaggtc gactggttca gacccagcgc ttgaagatca tggagtatta 360 cgaaaagaag gagaaacaga tcgaacagca gaagaaaatt caaatgtcca acctgatgaa 420 tcaggccaga ctgaaggtcc tgaaggccag agatgacatg attgcggatt tactaaacga 480 cgcacgtcag cgactagcga atgtagccag agatccgtcc agatacgcgg ctctgatgga 540 cggactggtt ctgcagggtt tttaccagct gctggaacct aaagtgacca tccgttgccg 600 caaacaggat gtggggattg tgcaggccgc cgtgcaaaaa aacatctcca tctacaaagc 660 agcagtaaag aacaatctag aagtgcgcat cgaccaagac aacttcctct ctccagaaat 720 ctctggaggt attgagctct ataatgcgga cgggaaaatt aaagtggcaa acactctgga 780 gagccgactg gaactcatcg cacagcacat gatgcctgaa ataccagtcg ccctgtttgg 840 tgcgaaccag aaccgcaagt tcatggactg atggatcttt ccggagatcg gcagtgtttt 900 gatgttgtat ttttattttg atgttaaaaa tccattcctg cgtctctatc tgtactgtat 960 ttaagaagac aacacctgtg tgtgtgtgtg tgtgtgcgta tgattcatca ttgctgtata 1020 aagatcaaaa tagtgtcctg cacatttctg tgtttgatta tgaataaatt aaagctccgt 1080 tatgttccta taaagtctta aagtgatgca taaccctcgc ttcatgaaga aaatatgact 1140 ttttttttat ttgcattcct gtaatgagga gcatgatctg tatatctgaa gcaaacgtgt 1200 ctgtcgatga tgcaaagaga ctatattgat gtgaaaacta taatgaaaac ccatatggcc 1260 cagaaaaaa 1269 16 226 PRT Danio rerio 16 Met Ala Leu Ser Asp Ala Asp Val Gln Lys Gln Ile Lys His Met Met 1 5 10 15 Ala Phe Ile Glu Gln Glu Ala Asn Glu Lys Ala Glu Glu Ile Asp Ala 20 25 30 Lys Ala Glu Glu Glu Phe Asn Ile Glu Lys Gly Arg Leu Val Gln Thr 35 40 45 Gln Arg Leu Lys Ile Met Glu Tyr Tyr Glu Lys Lys Glu Lys Gln Ile 50 55 60 Glu Gln Gln Lys Lys Ile Gln Met Ser Asn Leu Met Asn Gln Ala Arg 65 70 75 80 Leu Lys Val Leu Lys Ala Arg Asp Asp Met Ile Ala Asp Leu Leu Asn 85 90 95 Asp Ala Arg Gln Arg Leu Ala Asn Val Ala Arg Asp Pro Ser Arg Tyr 100 105 110 Ala Ala Leu Met Asp Gly Leu Val Leu Gln Gly Phe Tyr Gln Leu Leu 115 120 125 Glu Pro Lys Val Thr Ile Arg Cys Arg Lys Gln Asp Val Gly Ile Val 130 135 140 Gln Ala Ala Val Gln Lys Asn Ile Ser Ile Tyr Lys Ala Ala Val Lys 145 150 155 160 Asn Asn Leu Glu Val Arg Ile Asp Gln Asp Asn Phe Leu Ser Pro Glu 165 170 175 Ile Ser Gly Gly Ile Glu Leu Tyr Asn Ala Asp Gly Lys Ile Lys Val 180 185 190 Ala Asn Thr Leu Glu Ser Arg Leu Glu Leu Ile Ala Gln His Met Met 195 200 205 Pro Glu Ile Pro Val Ala Leu Phe Gly Ala Asn Gln Asn Arg Lys Phe 210 215 220 Met Asp 225 17 1740 DNA Danio rerio 17 ggccttgttg caatttgaca gcagaggaag cagctcctta aatataaaat ctgaagatgg 60 acatccgagg ggcagtggac gctgccgtcc ccaccaatat cattgcagcg aaggccgcag 120 aagtccgagc aaacaaggtc aactggcagt cctaccttca gggccaaatg atctccggcg 180 aggactgcga gttcattaag aagtttgagg ttgctggatc tgaggacaag caagcaattc 240 tgacaaacga aggacatcag tgtgcaaaga ccttcctcaa ccttatggct catatatcta 300 aggagcaaac cgttcagtac atcctgaccc tgattgatga cacactgcag gaaaatcatc 360 agagagtgaa tatcttcttt gattatgcaa agaagactaa gaatacagcc tggtcatact 420 tcctcccaat gctgaaccgt caagagctct tcactgttca catggctgcc agaatcatag 480 ccaaactggc tgcctgggga cgtgacctga tggaaggcag tgacttgaac tactacttca 540 actggattaa gacccagctc agctctcaga gctcccagta cgttcagtgt gtcgctggct 600 gcctacagtt gatgttgaga gtcaatgaat acagatttgc ctgggtggag gccgatggag 660 tgaactgcat cacggcagtg ctgagcaata agtgtggctt ccagctgcag tatcagatga 720 tcttctgcgt gtggctcctg gcattcagcc cgcagctctg cgaacagctg cggcgctaca 780 acgtggtacc agccctctcc gacatcctcc aagagtctgt caaagagaag gtcactcgca 840 taattctggc tgctttcagg aacctgctgg agaaatcagc cgaaagagag actcgtcaag 900 agtatgcgct tgccatgatc cagtgcaaag tgctaaaaca gctggagaac ctggaccagc 960 agaaatacga tgatgaggac atcaccgatg atattaaatt cctcttggaa agactgggag 1020 agagtgttca agatctcagt tcgtttgatg agtacagctc cgagctgaag tctggacgcc 1080 tggaatggag tccagtgcac aagtcagaga agttctggag ggaaaacgct gttcgcctga 1140 atgagaagaa ctacgagctc ctgaagatac tgaccaggct gttggaagtg tctgatgacc 1200 ctcaggttat tgccgtcgca gctcacgacg tgggggaata cgtgagacat taccctcgcg 1260 gcaagagggt gatcgaacag ctgggtggta aacaactagt gatgaaccac atgcatcatg 1320 aagaccagct tgtccgctat aatgccctcc tggccgtcca gaagctcatg gtgcataact 1380 gggagtacct ggggaagcag ttgcagtcca ccgagcagac gactccggct gtggctgcac 1440 gaagctgaac tgccctgcgt ttctgtatga gcctgtgtgt ctgatccagt gtcattggtg 1500 tgaatgtaaa tctgtgttat atatataatg aacaaggtaa accaaacggt tgttgtattg 1560 atccacaaat ttatgattta actcttgtgc ttttgtcgtt gtgttattgt gtattgtgcg 1620 tcctaataat aaaatcacgt caagtgatca ggtcacagag taaaatgaaa tgttatttat 1680 ggcttaagtg taaaacgatg agtatgcatt ccaataaaga atctaccaca acaaaaaaaa 1740 18 463 PRT Danio rerio 18 Met Asp Ile Arg Gly Ala Val Asp Ala Ala Val Pro Thr Asn Ile Ile 1 5 10 15 Ala Ala Lys Ala Ala Glu Val Arg Ala Asn Lys Val Asn Trp Gln Ser 20 25 30 Tyr Leu Gln Gly Gln Met Ile Ser Gly Glu Asp Cys Glu Phe Ile Lys 35 40 45 Lys Phe Glu Val Ala Gly Ser Glu Asp Lys Gln Ala Ile Leu Thr Asn 50 55 60 Glu Gly His Gln Cys Ala Lys Thr Phe Leu Asn Leu Met Ala His Ile 65 70 75 80 Ser Lys Glu Gln Thr Val Gln Tyr Ile Leu Thr Leu Ile Asp Asp Thr 85 90 95 Leu Gln Glu Asn His Gln Arg Val Asn Ile Phe Phe Asp Tyr Ala Lys 100 105 110 Lys Thr Lys Asn Thr Ala Trp Ser Tyr Phe Leu Pro Met Leu Asn Arg 115 120 125 Gln Glu Leu Phe Thr Val His Met Ala Ala Arg Ile Ile Ala Lys Leu 130 135 140 Ala Ala Trp Gly Arg Asp Leu Met Glu Gly Ser Asp Leu Asn Tyr Tyr 145 150 155 160 Phe Asn Trp Ile Lys Thr Gln Leu Ser Ser Gln Ser Ser Gln Tyr Val 165 170 175 Gln Cys Val Ala Gly Cys Leu Gln Leu Met Leu Arg Val Asn Glu Tyr 180 185 190 Arg Phe Ala Trp Val Glu Ala Asp Gly Val Asn Cys Ile Thr Ala Val 195 200 205 Leu Ser Asn Lys Cys Gly Phe Gln Leu Gln Tyr Gln Met Ile Phe Cys 210 215 220 Val Trp Leu Leu Ala Phe Ser Pro Gln Leu Cys Glu Gln Leu Arg Arg 225 230 235 240 Tyr Asn Val Val Pro Ala Leu Ser Asp Ile Leu Gln Glu Ser Val Lys 245 250 255 Glu Lys Val Thr Arg Ile Ile Leu Ala Ala Phe Arg Asn Leu Leu Glu 260 265 270 Lys Ser Ala Glu Arg Glu Thr Arg Gln Glu Tyr Ala Leu Ala Met Ile 275 280 285 Gln Cys Lys Val Leu Lys Gln Leu Glu Asn Leu Asp Gln Gln Lys Tyr 290 295 300 Asp Asp Glu Asp Ile Thr Asp Asp Ile Lys Phe Leu Leu Glu Arg Leu 305 310 315 320 Gly Glu Ser Val Gln Asp Leu Ser Ser Phe Asp Glu Tyr Ser Ser Glu 325 330 335 Leu Lys Ser Gly Arg Leu Glu Trp Ser Pro Val His Lys Ser Glu Lys 340 345 350 Phe Trp Arg Glu Asn Ala Val Arg Leu Asn Glu Lys Asn Tyr Glu Leu 355 360 365 Leu Lys Ile Leu Thr Arg Leu Leu Glu Val Ser Asp Asp Pro Gln Val 370 375 380 Ile Ala Val Ala Ala His Asp Val Gly Glu Tyr Val Arg His Tyr Pro 385 390 395 400 Arg Gly Lys Arg Val Ile Glu Gln Leu Gly Gly Lys Gln Leu Val Met 405 410 415 Asn His Met His His Glu Asp Gln Leu Val Arg Tyr Asn Ala Leu Leu 420 425 430 Ala Val Gln Lys Leu Met Val His Asn Trp Glu Tyr Leu Gly Lys Gln 435 440 445 Leu Gln Ser Thr Glu Gln Thr Thr Pro Ala Val Ala Ala Arg Ser 450 455 460 19 1439 DNA Danio rerio 19 ttcttcctct ccgccggcga ccagcctcct gtcgggttcg tgaccgtctc tcctgctttt 60 gtaagacctg tcagccaaaa gaagacctca ctcagagtct aaacctgcta gactgaagct 120 ttcctccctt ttctgccttt agttttagtc tcccgaacgc gcgagaagaa tggacaccca 180 gaacccccag tactctccct tcctcgcagt gatgggtgcc tcctcagcga tggtgttcag 240 cgcgttgggt gctgcatacg ggactgctaa gagcggcaca ggcattgctg ccatgtcagt 300 gatgcggcca gagctgatca tgaagtccat cattcccgtg gtcatggcgg gtatcattgc 360 catctacggc ctggtggtgg ctgtactcat tgccaacaat atcggtgata aaatctcgct 420 ctacaagagt atcctgcatc tgggagcggg tctgagtgtg ggtctgagcg gtctcgctgc 480 cggcttcgcc atcgggatcg tgggagacgc aggtgtgaga ggaacggccc agcagccgcg 540 tctgttcgtg ggcatgatcc tcatcttgat ctttgcagag gtgctggggc tctacggcct 600 gatcgtcgcc ctcatcctgt ccactaaggg ttaatcgttg ctcctatcct ctcttacaca 660 tgaacaaaca cacacacaga cacacacacc aacacaaatg atatacaacg agtaaacttc 720 accgcctgaa taacgacgag aggaattttt gcgtgcatgc gtgtatcgtg agtgtgctgt 780 ttgcgatgag tgagcgtgag attgatcttc ctctctgtgt aaagaagccc gatgtgtgtg 840 tgtttttctt tcccccctgt aaatgcgctg tgtagctatc gctctcgtcc cgtgtgtgtg 900 tgtgtgcagt ctatacacct caagctttgc cagtggaacc catgtaccag cctgtggcac 960 aatcaggctc agagtctatc tctctcgctg aataccagat atcccctgtt ccaggacaga 1020 ttgtttccca gcctcaaatg cagcctggta gtctgtctct cactcagtcc agctcttctt 1080 caggcatgcc attgcagtcc agcttccagc ctcttgtaca atcaagttat gagactgtct 1140 tacagcctgg atttcaaact acttcagagc ttgtgccatc aagtcaccag tctacaagtt 1200 cccaagtctt gagtggtcca gaagtttcaa accatgagtc tgtgtttcag ccagtgcaac 1260 ctgaatttgc aattccacaa caggttgaaa acaagttggt tacaggtcaa aataatcctg 1320 gacctgctca aaatgggcat tcatcgcaga agttgttcag gctgttgcaa caagtgaagt 1380 cttaggtgtt tgctgtttcc tgagtttgct ttctgacttc caataaaatg tttaaaaag 1439 20 154 PRT Danio rerio 20 Met Asp Thr Gln Asn Pro Gln Tyr Ser Pro Phe Leu Ala Val Met Gly 1 5 10 15 Ala Ser Ser Ala Met Val Phe Ser Ala Leu Gly Ala Ala Tyr Gly Thr 20 25 30 Ala Lys Ser Gly Thr Gly Ile Ala Ala Met Ser Val Met Arg Pro Glu 35 40 45 Leu Ile Met Lys Ser Ile Ile Pro Val Val Met Ala Gly Ile Ile Ala 50 55 60 Ile Tyr Gly Leu Val Val Ala Val Leu Ile Ala Asn Asn Ile Gly Asp 65 70 75 80 Lys Ile Ser Leu Tyr Lys Ser Ile Leu His Leu Gly Ala Gly Leu Ser 85 90 95 Val Gly Leu Ser Gly Leu Ala Ala Gly Phe Ala Ile Gly Ile Val Gly 100 105 110 Asp Ala Gly Val Arg Gly Thr Ala Gln Gln Pro Arg Leu Phe Val Gly 115 120 125 Met Ile Leu Ile Leu Ile Phe Ala Glu Val Leu Gly Leu Tyr Gly Leu 130 135 140 Ile Val Ala Leu Ile Leu Ser Thr Lys Gly 145 150 21 3553 DNA Danio rerio 21 gccaggatat tcatcagctg acccaagaca gacatgcaac caacacagct cggtttctca 60 gatctgtcat tcctttttca tttataaaaa ttcccttggc gcgtaatttt gttttatttt 120 acaccaatag ggtgtccatt atgttttaat gagcacgcaa agaggacgat gacagtgaca 180 gcttgtgaac aggacattta acgttagctc gcatgctagc gtgctatggg tggaaacagc 240 tggtagggat actgtacatc tttacctgca atgtaaacac aggttcacta tttgctgtac 300 atcgtcgtgt gaatatttga tcatttgcaa tggcatctat tctggatcaa tatgaagatt 360 cgcagaacat ccgtcaacac agtcgtatgt ccacggccaa catcggcatc actcattcag 420 ggtttgtaaa tgtaagattg gaagaagaaa aacccatttt caccaagcag cgcatcgatt 480 tctctcctcc cgaaaaaatc aaccaatttt cggtttgcaa caatcagtta tgcatgagct 540 tgggaaaaga caccctgctg agaattgatc tcggaaaacc tgatcagcca aatcagattg 600 aattgggaag aaaagatgat agcaaagtac acagactttt cctagatcct acagggtcac 660 atttagtcat ctgcctcact acaaatgaat gtgtgtacct aaatagaaac actcagaagg 720 tccgaggtct gtcaaggtgg agaggccact tgattgaaag cataggttgg aacaaattaa 780 tcggctctga gaccaacact ggccccattc tggtcgggac cagccaggga atcatctttg 840 aagctgagat ctctgcctca gagggcagtc tgttcaacac caacccggac cagtacttca 900 gacaggttca ttacctcgag gaggatggca agccagctcc ggtttgttgc ctggaggttg 960 aacgtggttt ggaaactaaa tactttatca ttgcaaccac acgaaaacgg ctgttccagt 1020 ttgtaggaaa gctggcagaa ggatctgagc agcagggctt cagctccatc tttgcccaga 1080 accaggatct tctacccagt tccaggaatt tccctgtcaa tatgggctat agtgagatta 1140 cattttacac atcaaagctg cgtagtcgcc ccaagacctt cgcctggatg atggggaatg 1200 gcgttttgta cgggcaactg gactatgtgc gccccgactc tttactcagt gatgttcaag 1260 tgtgggaata cacgcaagac attgatttaa acttcgtcaa acccatctct atagtcctca 1320 cccagttcca cttcttgctc ctgctccctg accgtgtccg agggatctgc accctgaatg 1380 ggcaggttgt gcatgaagat gtgtttcccg aaaagttcgg tacactacaa aagatgatca 1440 aagatcccat aacaggactt gtgtggatct acactgagaa agcagtgttt cgatatcaca 1500 tccagaaaga agcacgggac gtttggcaga tgtacatgaa catgaataag tttgatctag 1560 ccaaagagta ctgcaaagat cgaccagaat gcttagacat ggttctcgca aaagaggctg 1620 agcactgctt tcaaaacaag cgctacctag agagcgccaa atgctatgcg ctcactcaaa 1680 attactttga ggagattgcg ctgaagttta ttgaggccaa gcaggaggag gccttgaaag 1740 agtttctgat taagaagctg gtcaatctca agcccagtga gaagactcag atcaccctgc 1800 tggtgacctg gttgaccgaa ctctatctga accggctcgg tcaactggag gccgacgaag 1860 gcaagcaaca tctatttctg gagacccgtg aagagtttcg tacttttctt aaaagcccca 1920 aacataaaga ttgcttctac aacaatcgca gcaccatcta cgacctgctc gctagtcacg 1980 gcgacgtgga caacatggtg tatttctcgg tcatcatgca agattacgag cgtgtcattt 2040 ctcattactg ccagcatgac gactacagcg cagctttaga tgttctctcc aaacactgtg 2100 acgacaagct cttctacaag ttctccccag tcctcatgca gcacatcccc aaaaaggtgg 2160 tcgatgcatg gattcagatg ggaaaccggt tggatccgaa gaatctgatc ccagcgcttg 2220 tgaactacag ccagatgggc agcatgcagc aaataaacga gaccatccgg tacatggagt 2280 tctgtgtgta tgagctggac gtcaaagaag aggccattca caattaccta ctctcgcttt 2340 atgccaaaca caagccagac gctctgttgt ggtacctcga acaagccgga actcacgtct 2400 ccgatattca ttacgacttg aaatatgcct tgcgcctttg ctctgaacat ggatacctgc 2460 aggcctgcgt cttggtttac aagattatgg agctgtatga agaagctgtg gatcttgctc 2520 taaaagtgga tgtggatttg gccaagtctt gcgccgacct ttcagaggac gatgaggagt 2580 taagaaagaa gctgtggctg aagatcgctc gccatgtagt gcaggaggag aaggatgtca 2640 agaaggctat gaactgcctg tccagttgca acctgcttaa gatcgaagac atcttgcctt 2700 tctttccaga ctttgtcaca atagaccact tcaaggaggc gatatgcagc tctttggaag 2760 agtacaacaa acacatcgaa gagctgaaac aagaaatgga ggaggccaca gaaagtgcca 2820 aacgcatccg cgaagacatc caagagatga ggaacaagta tggcgtggtt gagtctcagg 2880 aaaaatgcgc tacttgtgat ttccccctgc tcaaccgacc tttctatctt ttcctgtgcg 2940 gacacatgtt ccattatgat tgcctcctgc aggaagtcat tcctcacctc tctgtctata 3000 agcagaataa attggatgag ctacagaaga aactggcagc gaccacgcag accaccaagg 3060 ctcggcacaa gcctagagag gaggacacgg tcagcttagg caaaggccag ggaagcaggg 3120 agcagatcaa atcagacatt gatgatatta tagcgtgcga gtgcgtgtac tgcggtgagc 3180 tgatgatcaa gtccattgac aagccgttca tagatccgca gaagttcgat caagagatgt 3240 ccagctggct ctgaaggaga gacgcttctg cttgagctgc acaacacatt tattatacac 3300 attgtacatt tgcttaacat gtaaccggat gtgcattgta aaatttttac attctcaatt 3360 aatgaacatt tatatgattt tgtcaacaaa tatgcactgt acaagtgttc ttcgttgatt 3420 ttatgtgttt gcttccttag atcaaactta aaaatgcact tgttttcatt ttgcataaaa 3480 gtaatgcgaa taatatcttt tttatgtgcg cagctaagaa gaataaatga ttgccttttg 3540 actggaaaaa aaa 3553 22 974 PRT Danio rerio 22 Met Ala Ser Ile Leu Asp Gln Tyr Glu Asp Ser Gln Asn Ile Arg Gln 1 5 10 15 His Ser Arg Met Ser Thr Ala Asn Ile Gly Ile Thr His Ser Gly Phe 20 25 30 Val Asn Val Arg Leu Glu Glu Glu Lys Pro Ile Phe Thr Lys Gln Arg 35 40 45 Ile Asp Phe Ser Pro Pro Glu Lys Ile Asn Gln Phe Ser Val Cys Asn 50 55 60 Asn Gln Leu Cys Met Ser Leu Gly Lys Asp Thr Leu Leu Arg Ile Asp 65 70 75 80 Leu Gly Lys Pro Asp Gln Pro Asn Gln Ile Glu Leu Gly Arg Lys Asp 85 90 95 Asp Ser Lys Val His Arg Leu Phe Leu Asp Pro Thr Gly Ser His Leu 100 105 110 Val Ile Cys Leu Thr Thr Asn Glu Cys Val Tyr Leu Asn Arg Asn Thr 115 120 125 Gln Lys Val Arg Gly Leu Ser Arg Trp Arg Gly His Leu Ile Glu Ser 130 135 140 Ile Gly Trp Asn Lys Leu Ile Gly Ser Glu Thr Asn Thr Gly Pro Ile 145 150 155 160 Leu Val Gly Thr Ser Gln Gly Ile Ile Phe Glu Ala Glu Ile Ser Ala 165 170 175 Ser Glu Gly Ser Leu Phe Asn Thr Asn Pro Asp Gln Tyr Phe Arg Gln 180 185 190 Val His Tyr Leu Glu Glu Asp Gly Lys Pro Ala Pro Val Cys Cys Leu 195 200 205 Glu Val Glu Arg Gly Leu Glu Thr Lys Tyr Phe Ile Ile Ala Thr Thr 210 215 220 Arg Lys Arg Leu Phe Gln Phe Val Gly Lys Leu Ala Glu Gly Ser Glu 225 230 235 240 Gln Gln Gly Phe Ser Ser Ile Phe Ala Gln Asn Gln Asp Leu Leu Pro 245 250 255 Ser Ser Arg Asn Phe Pro Val Asn Met Gly Tyr Ser Glu Ile Thr Phe 260 265 270 Tyr Thr Ser Lys Leu Arg Ser Arg Pro Lys Thr Phe Ala Trp Met Met 275 280 285 Gly Asn Gly Val Leu Tyr Gly Gln Leu Asp Tyr Val Arg Pro Asp Ser 290 295 300 Leu Leu Ser Asp Val Gln Val Trp Glu Tyr Thr Gln Asp Ile Asp Leu 305 310 315 320 Asn Phe Val Lys Pro Ile Ser Ile Val Leu Thr Gln Phe His Phe Leu 325 330 335 Leu Leu Leu Pro Asp Arg Val Arg Gly Ile Cys Thr Leu Asn Gly Gln 340 345 350 Val Val His Glu Asp Val Phe Pro Glu Lys Phe Gly Thr Leu Gln Lys 355 360 365 Met Ile Lys Asp Pro Ile Thr Gly Leu Val Trp Ile Tyr Thr Glu Lys 370 375 380 Ala Val Phe Arg Tyr His Ile Gln Lys Glu Ala Arg Asp Val Trp Gln 385 390 395 400 Met Tyr Met Asn Met Asn Lys Phe Asp Leu Ala Lys Glu Tyr Cys Lys 405 410 415 Asp Arg Pro Glu Cys Leu Asp Met Val Leu Ala Lys Glu Ala Glu His 420 425 430 Cys Phe Gln Asn Lys Arg Tyr Leu Glu Ser Ala Lys Cys Tyr Ala Leu 435 440 445 Thr Gln Asn Tyr Phe Glu Glu Ile Ala Leu Lys Phe Ile Glu Ala Lys 450 455 460 Gln Glu Glu Ala Leu Lys Glu Phe Leu Ile Lys Lys Leu Val Asn Leu 465 470 475 480 Lys Pro Ser Glu Lys Thr Gln Ile Thr Leu Leu Val Thr Trp Leu Thr 485 490 495 Glu Leu Tyr Leu Asn Arg Leu Gly Gln Leu Glu Ala Asp Glu Gly Lys 500 505 510 Gln His Leu Phe Leu Glu Thr Arg Glu Glu Phe Arg Thr Phe Leu Lys 515 520 525 Ser Pro Lys His Lys Asp Cys Phe Tyr Asn Asn Arg Ser Thr Ile Tyr 530 535 540 Asp Leu Leu Ala Ser His Gly Asp Val Asp Asn Met Val Tyr Phe Ser 545 550 555 560 Val Ile Met Gln Asp Tyr Glu Arg Val Ile Ser His Tyr Cys Gln His 565 570 575 Asp Asp Tyr Ser Ala Ala Leu Asp Val Leu Ser Lys His Cys Asp Asp 580 585 590 Lys Leu Phe Tyr Lys Phe Ser Pro Val Leu Met Gln His Ile Pro Lys 595 600 605 Lys Val Val Asp Ala Trp Ile Gln Met Gly Asn Arg Leu Asp Pro Lys 610 615 620 Asn Leu Ile Pro Ala Leu Val Asn Tyr Ser Gln Met Gly Ser Met Gln 625 630 635 640 Gln Ile Asn Glu Thr Ile Arg Tyr Met Glu Phe Cys Val Tyr Glu Leu 645 650 655 Asp Val Lys Glu Glu Ala Ile His Asn Tyr Leu Leu Ser Leu Tyr Ala 660 665 670 Lys His Lys Pro Asp Ala Leu Leu Trp Tyr Leu Glu Gln Ala Gly Thr 675 680 685 His Val Ser Asp Ile His Tyr Asp Leu Lys Tyr Ala Leu Arg Leu Cys 690 695 700 Ser Glu His Gly Tyr Leu Gln Ala Cys Val Leu Val Tyr Lys Ile Met 705 710 715 720 Glu Leu Tyr Glu Glu Ala Val Asp Leu Ala Leu Lys Val Asp Val Asp 725 730 735 Leu Ala Lys Ser Cys Ala Asp Leu Ser Glu Asp Asp Glu Glu Leu Arg 740 745 750 Lys Lys Leu Trp Leu Lys Ile Ala Arg His Val Val Gln Glu Glu Lys 755 760 765 Asp Val Lys Lys Ala Met Asn Cys Leu Ser Ser Cys Asn Leu Leu Lys 770 775 780 Ile Glu Asp Ile Leu Pro Phe Phe Pro Asp Phe Val Thr Ile Asp His 785 790 795 800 Phe Lys Glu Ala Ile Cys Ser Ser Leu Glu Glu Tyr Asn Lys His Ile 805 810 815 Glu Glu Leu Lys Gln Glu Met Glu Glu Ala Thr Glu Ser Ala Lys Arg 820 825 830 Ile Arg Glu Asp Ile Gln Glu Met Arg Asn Lys Tyr Gly Val Val Glu 835 840 845 Ser Gln Glu Lys Cys Ala Thr Cys Asp Phe Pro Leu Leu Asn Arg Pro 850 855 860 Phe Tyr Leu Phe Leu Cys Gly His Met Phe His Tyr Asp Cys Leu Leu 865 870 875 880 Gln Glu Val Ile Pro His Leu Ser Val Tyr Lys Gln Asn Lys Leu Asp 885 890 895 Glu Leu Gln Lys Lys Leu Ala Ala Thr Thr Gln Thr Thr Lys Ala Arg 900 905 910 His Lys Pro Arg Glu Glu Asp Thr Val Ser Leu Gly Lys Gly Gln Gly 915 920 925 Ser Arg Glu Gln Ile Lys Ser Asp Ile Asp Asp Ile Ile Ala Cys Glu 930 935 940 Cys Val Tyr Cys Gly Glu Leu Met Ile Lys Ser Ile Asp Lys Pro Phe 945 950 955 960 Ile Asp Pro Gln Lys Phe Asp Gln Glu Met Ser Ser Trp Leu 965 970 23 3291 DNA Danio rerio 23 acgcgggata tcgttcagtc cgacagggaa gtgacctaac tttctcggag caggtgtctt 60 tggaggagct cacacttgga ggcgtgagga ggggaaagat cacaataccg catttccctc 120 tctagacctc ctaaatattt agatgtttgc aaacatggtg tccaagttga catctttgca 180 acaggagctt ctgagcgcct tgttggactc tggagttacc aaagatgtgc ttctgcaagc 240 tttggaggac ctggacccga gcccgagcgc ttttggagtt aaactggaca gtctgcagat 300 gtccccgtcc ggctccaaac tcagcgatac ggattcaaag ccggtgtttc atacgctcac 360 taatggacac agtaaaggga agttatccgg agatgaaggc tccgaggacg gggatgatta 420 tgacacaccg ccaattctca aagagcttca gtctcaaaac accgaggaag cggccgagca 480 gagggcagaa atagagcgaa tgttggcgga ggacccgtgg cgcgcggccc gcatgatcaa 540 aggctacatg cagcagcaca atatccctca gcgcgaggtg gtggatgtga cgggcctgaa 600 tcagtcgcac ctgtcgcagc acctgaataa aggcacgcct atgaaaacgc agaaacgcgc 660 ggcgctctac acctggtatg tgaggaaaca gcgggagatc ttgcgacaat tcaaccaggc 720 cacacaaggc tctggcgcca ccatgttaga caaaggaaat caggatcagg tactcctttt 780 tttctcagag tttagtcagt ctgggcaagg tatggtccag ccaggtgatg atgctgccat 840 tgagcctgct tgcaagaagc tcagacgcaa ccgtttcaaa tgggggcctg catcccaaca 900 aatcctttac caggcttatg agcggcagaa gaaccccagc aaagaggagc gagaggcatt 960 ggtggaggag tgcaaccggg ctgagtgcct ccaaagagga gtgtctccat cgaaagctca 1020 tggacttggc tccaacctgg tcacagaggt gcgagtctac aactggtttg ctaacagaag 1080 aaaggaagaa gccttcagac aaaagttggc tatggatgcc tacagtggcc cagcacatag 1140 cctaaactcc ctcctttcac atagttcccc gcatcaccca caaactagca gctccccacc 1200 aagcaagatg caaggtgtcc ggtacagcca acaaggtcca ggtgaagtca cttcttcaac 1260 aacgatcaat caccatagca gcaatgccat gtcaaccagc cagtcagtgt tacagcaggt 1320 ctctccaggg gcgctggacc ccagtcacag cctcctgtca cctgatgcca agatgatctc 1380 ggtatcaggt ggaggtcttc ctccagtaag caccctgacc aacatccacg catcccacca 1440 cgtgcaccaa caaaccccta acctcataat gccactctct ggagtcatgg ccattgcaca 1500 aagtttgaac acatcacaag cacagacggt gcctgtcata aacagtgttg caggcagtct 1560 ggcagcactg cagccagtac agttctctca gcagttaaat agccaacacc agcagctcat 1620 gcaacagtca tctggacaca tgagccaaca gtccttcatg gcctctgtct cacactcaca 1680 catgtaccca cacaagcagg agccacccca gtattctcat tcgtctcggt ttcccccggc 1740 catggtggtg acagacgcca acagcctcag tacactgagc tcaatgtcct ccagcaaaca 1800 gtgtccacta caggcctggt gaatgtatac acctgacttc ttcgatgcca agcaacagga 1860 gcacattgtc ccaacctcca ttccccttca tcgcacctat atccatggca acccataata 1920 tgtgcgagcg gcacgctgga tgatcagtgg gtcaataaac cacgagctga aggcaggact 1980 gatgagaatg aacactgttg gggtgtttgg gttcacagtt gggtgactag aagtctggct 2040 cggatgatca acatggcgcc caatttagtg acagtatttt acttctcatt gatggacaaa 2100 ctaaattaag gaacgcacat gatccatatg aacaaacaaa tacgaaacgc atacaacaca 2160 gtcagatgaa gatgcaaaga tggagcggga agacctggaa tccaggaata agagcacaaa 2220 atgtgtgtgt atgtaaaact ttacaatcaa actgtgtgga tccaatgaca atcagacaat 2280 gttagacgca tcgttaatgt aaaataaccg atgcaaagac gtcttccttg agatatcaat 2340 gcttagaaaa tgttctgtac agttgggtgt aaatacgagc aattactata tcattattat 2400 tattattatt atttcattac tatgaaggat gtaaatattc agttttatct gtctaaacat 2460 gtaaactgtg attgtgaatg tgacaatcta aggaacaggc atgatgtatt ggaatatgca 2520 gtgtgtcaga gagaaggatt tcaaaacttt tcagaaagct tcgttgacgt cagatcttaa 2580 aagagagaca aatgtaaaag cacattcaac aaaacgttga acctattgat ccttctgtaa 2640 gccttgtttt aaaatagaca ctgttgtcaa tgttgactgc cattagtgtg gcgtcagata 2700 gataagagtc atctttctgt gccagccaaa gaaatactga attaggaagt ggtcgatcgg 2760 agagactcag agataactct atctgtaaag aagttcacag tttttgtgat actgggaaga 2820 acggccatga atcatgacat ctctaattta aatattgacc ttcagtcaaa ctgtcattgt 2880 gatggtgggg agatatctga agaaaagagc gtcctgttga attggtcagc gtaattgtta 2940 tcagctggaa tttcagaagt tgaggaaatc gaatcagttc gccacttcaa acagtattag 3000 ctgctcacgc aaccctgaac taccaaaaag cattgtgaaa tagtttttac tgtggctttt 3060 ttttttacat ttaggcccaa tcccaattct accccttagc ccttttcctt actcctaacc 3120 ctcgttttgc gtgttcatgt gaaggggtaa aggtgtccaa attttttttg gcttgaaggc 3180 atagggctaa tgggaagggc taaatacccc ctgaaacaga tttttaagga ccacacttga 3240 aaccaagggg taagaaaatt tcccaaaaaa aaaaaaaaaa aaaaaaaaaa a 3291 24 559 PRT Danio rerio 24 Met Phe Ala Asn Met Val Ser Lys Leu Thr Ser Leu Gln Gln Glu Leu 1 5 10 15 Leu Ser Ala Leu Leu Asp Ser Gly Val Thr Lys Asp Val Leu Leu Gln 20 25 30 Ala Leu Glu Asp Leu Asp Pro Ser Pro Ser Ala Phe Gly Val Lys Leu 35 40 45 Asp Ser Leu Gln Met Ser Pro Ser Gly Ser Lys Leu Ser Asp Thr Asp 50 55 60 Ser Lys Pro Val Phe His Thr Leu Thr Asn Gly His Ser Lys Gly Lys 65 70 75 80 Leu Ser Gly Asp Glu Gly Ser Glu Asp Gly Asp Asp Tyr Asp Thr Pro 85 90 95 Pro Ile Leu Lys Glu Leu Gln Ser Gln Asn Thr Glu Glu Ala Ala Glu 100 105 110 Gln Arg Ala Glu Ile Glu Arg Met Leu Ala Glu Asp Pro Trp Arg Ala 115 120 125 Ala Arg Met Ile Lys Gly Tyr Met Gln Gln His Asn Ile Pro Gln Arg 130 135 140 Glu Val Val Asp Val Thr Gly Leu Asn Gln Ser His Leu Ser Gln His 145 150 155 160 Leu Asn Lys Gly Thr Pro Met Lys Thr Gln Lys Arg Ala Ala Leu Tyr 165 170 175 Thr Trp Tyr Val Arg Lys Gln Arg Glu Ile Leu Arg Gln Phe Asn Gln 180 185 190 Ala Thr Gln Gly Ser Gly Ala Thr Met Leu Asp Lys Gly Asn Gln Asp 195 200 205 Gln Val Leu Leu Phe Phe Ser Glu Phe Ser Gln Ser Gly Gln Gly Met 210 215 220 Val Gln Pro Gly Asp Asp Ala Ala Ile Glu Pro Ala Cys Lys Lys Leu 225 230 235 240 Arg Arg Asn Arg Phe Lys Trp Gly Pro Ala Ser Gln Gln Ile Leu Tyr 245 250 255 Gln Ala Tyr Glu Arg Gln Lys Asn Pro Ser Lys Glu Glu Arg Glu Ala 260 265 270 Leu Val Glu Glu Cys Asn Arg Ala Glu Cys Leu Gln Arg Gly Val Ser 275 280 285 Pro Ser Lys Ala His Gly Leu Gly Ser Asn Leu Val Thr Glu Val Arg 290 295 300 Val Tyr Asn Trp Phe Ala Asn Arg Arg Lys Glu Glu Ala Phe Arg Gln 305 310 315 320 Lys Leu Ala Met Asp Ala Tyr Ser Gly Pro Ala His Ser Leu Asn Ser 325 330 335 Leu Leu Ser His Ser Ser Pro His His Pro Gln Thr Ser Ser Ser Pro 340 345 350 Pro Ser Lys Met Gln Gly Val Arg Tyr Ser Gln Gln Gly Pro Gly Glu 355 360 365 Val Thr Ser Ser Thr Thr Ile Asn His His Ser Ser Asn Ala Met Ser 370 375 380 Thr Ser Gln Ser Val Leu Gln Gln Val Ser Pro Gly Ala Leu Asp Pro 385 390 395 400 Ser His Ser Leu Leu Ser Pro Asp Ala Lys Met Ile Ser Val Ser Gly 405 410 415 Gly Gly Leu Pro Pro Val Ser Thr Leu Thr Asn Ile His Ala Ser His 420 425 430 His Val His Gln Gln Thr Pro Asn Leu Ile Met Pro Leu Ser Gly Val 435 440 445 Met Ala Ile Ala Gln Ser Leu Asn Thr Ser Gln Ala Gln Thr Val Pro 450 455 460 Val Ile Asn Ser Val Ala Gly Ser Leu Ala Ala Leu Gln Pro Val Gln 465 470 475 480 Phe Ser Gln Gln Leu Asn Ser Gln His Gln Gln Leu Met Gln Gln Ser 485 490 495 Ser Gly His Met Ser Gln Gln Ser Phe Met Ala Ser Val Ser His Ser 500 505 510 His Met Tyr Pro His Lys Gln Glu Pro Pro Gln Tyr Ser His Ser Ser 515 520 525 Arg Phe Pro Pro Ala Met Val Val Thr Asp Ala Asn Ser Leu Ser Thr 530 535 540 Leu Ser Ser Met Ser Ser Ser Lys Gln Cys Pro Leu Gln Ala Trp 545 550 555 25 474 DNA Danio rerio 25 ccacgcgtcc gcccacgcgt ccggaaaaat ggcaaagatc aaggccagag accttcgcgg 60 aaagaaaaag gaggagctgc taaaacagct ggatgatctg aaggtggaac tttcccagct 120 ccgcgttgcc aaggttaccg gtggagctgc ttccaagctc tcaaaaatcc gtgttgtccg 180 caagtccatc gccagagtcc tcacagtcat caaccagaca cagaaggaga acttgaggaa 240 gttttacaag ggtaagaagt acaagccttt ggacctgaga cccaagaaga cccgtgccat 300 ccgtcgtcag ctcacaaaac acgaagagaa cctgatgact aagaagatgc agaggaaatc 360 tcgtctctac tccattcgca agttcgccgt caaggcataa aaaagctgta cttttgtttt 420 tcactgaaaa ataaaatccc ccgctcaccc tggaaaaaaa aaaaaaaaaa aaaa 474 26 123 PRT Danio rerio 26 Met Ala Lys Ile Lys Ala Arg Asp Leu Arg Gly Lys Lys Lys Glu Glu 1 5 10 15 Leu Leu Lys Gln Leu Asp Asp Leu Lys Val Glu Leu Ser Gln Leu Arg 20 25 30 Val Ala Lys Val Thr Gly Gly Ala Ala Ser Lys Leu Ser Lys Ile Arg 35 40 45 Val Val Arg Lys Ser Ile Ala Arg Val Leu Thr Val Ile Asn Gln Thr 50 55 60 Gln Lys Glu Asn Leu Arg Lys Phe Tyr Lys Gly Lys Lys Tyr Lys Pro 65 70 75 80 Leu Asp Leu Arg Pro Lys Lys Thr Arg Ala Ile Arg Arg Gln Leu Thr 85 90 95 Lys His Glu Glu Asn Leu Met Thr Lys Lys Met Gln Arg Lys Ser Arg 100 105 110 Leu Tyr Ser Ile Arg Lys Phe Ala Val Lys Ala 115 120 27 388 DNA Danio rerio 27 cgagggcaac catggtaaac gtaccgaaga cccgcaggac gtactgcaaa aaatgcaaga 60 aacaccagcc gcataaagtg acccagtaca agaagggtaa agactctctg tacgcccagg 120 gaaagaggcg ttacgacaga aagcagagtg gttatggagg acagaccaag cctattttcc 180 gaaaaaaggc taaaaccaca aagaagatcg tgctgaggct ggagtgcgtg gagcctaact 240 gccgctcaaa gaggatgctg gccattaaga gatgcaaaca ctttgagctg ggaggagaca 300 agaagagaaa gggccaggtc atccagtttt aagctgggcc aagtgcttct tttgtggatt 360 accatcgctg tcaataaaag ttgtatac 388 28 106 PRT Danio rerio 28 Met Val Asn Val Pro Lys Thr Arg Arg Thr Tyr Cys Lys Lys Cys Lys 1 5 10 15 Lys His Gln Pro His Lys Val Thr Gln Tyr Lys Lys Gly Lys Asp Ser 20 25 30 Leu Tyr Ala Gln Gly Lys Arg Arg Tyr Asp Arg Lys Gln Ser Gly Tyr 35 40 45 Gly Gly Gln Thr Lys Pro Ile Phe Arg Lys Lys Ala Lys Thr Thr Lys 50 55 60 Lys Ile Val Leu Arg Leu Glu Cys Val Glu Pro Asn Cys Arg Ser Lys 65 70 75 80 Arg Met Leu Ala Ile Lys Arg Cys Lys His Phe Glu Leu Gly Gly Asp 85 90 95 Lys Lys Arg Lys Gly Gln Val Ile Gln Phe 100 105 29 1025 DNA Danio rerio 29 cgtaagcgga tgtacgaata cgcaagaatt caacgcacga aggaagagcg agcgacatct 60 gtacagtaca agatggatac gctaatattg gagccttctc tgtacactgt aaaagcggtt 120 ctgatcatgg ataatgatgg agagagactg tatgcaaagt attatgatga cacgtatccc 180 acagtgaaag agcagaaagc ctttgagaag aacatcttta acaagacgca cagaacagac 240 agtgaaatcg cgttgctgga gggtctaacg gttgtgtaca agagcaatat tgatttatat 300 ttctatgtaa ttggcagctc tcatgaaaat gagctgatgc ttatgtcagt gttgaattgt 360 ctctttgatt ctctgagcca aatgctgagg aaaaatgtgg aaaagagagc cctgctggaa 420 aatatggagg gtcttttctt ggccgttgat gagattgttg atggaggagt gattctggag 480 agcgacccac agcaggtggt tcaccgtgtc gcattgaggg gcgatgacgt gcctctgaca 540 gaacagacag tcactcaggt gctacagtca gcgaaggagc agatcaaatg gtctctcctg 600 cgataggatt cttcagcttt ctgtcctgaa ctgaagcgag gaaaagcaat gagagagcac 660 ctcttttcat cactactgtc cctttctgcc cttttctccg tgagtgtttt catgactgtt 720 aaagtgcaca tccactgaaa accacttctc cagtttctct cactccagat aagaccagca 780 acagcgctat ttctacacat taacacaaga ggaagaagct tgccgggact ttctttttgg 840 gaaaagtacc ggggcgtttg ttttttttta aagcccattc cagtaagcgc ttgaatatat 900 ggctggttta gtgaatgata tcactgttgt actgtgagat tttcatgagg attcatgtac 960 attccagtgg tcgatctttt ctctgttttt ggattgaagt aaagagtttc atcccaaaaa 1020 aaaaa 1025 30 177 PRT Danio rerio 30 Met Asp Thr Leu Ile Leu Glu Pro Ser Leu Tyr Thr Val Lys Ala Val 1 5 10 15 Leu Ile Met Asp Asn Asp Gly Glu Arg Leu Tyr Ala Lys Tyr Tyr Asp 20 25 30 Asp Thr Tyr Pro Thr Val Lys Glu Gln Lys Ala Phe Glu Lys Asn Ile 35 40 45 Phe Asn Lys Thr His Arg Thr Asp Ser Glu Ile Ala Leu Leu Glu Gly 50 55 60 Leu Thr Val Val Tyr Lys Ser Asn Ile Asp Leu Tyr Phe Tyr Val Ile 65 70 75 80 Gly Ser Ser His Glu Asn Glu Leu Met Leu Met Ser Val Leu Asn Cys 85 90 95 Leu Phe Asp Ser Leu Ser Gln Met Leu Arg Lys Asn Val Glu Lys Arg 100 105 110 Ala Leu Leu Glu Asn Met Glu Gly Leu Phe Leu Ala Val Asp Glu Ile 115 120 125 Val Asp Gly Gly Val Ile Leu Glu Ser Asp Pro Gln Gln Val Val His 130 135 140 Arg Val Ala Leu Arg Gly Asp Asp Val Pro Leu Thr Glu Gln Thr Val 145 150 155 160 Thr Gln Val Leu Gln Ser Ala Lys Glu Gln Ile Lys Trp Ser Leu Leu 165 170 175 Arg 31 2144 DNA Danio rerio 31 gactctcccc atgtgtttac ttttgtggaa taaatttatt taaactatgg aaaacattac 60 cgatggggga tgggggagtc tgccggtgaa attacatgac aacatattac agactcttaa 120 agagctggga tttacatata tgactcccgt tcagtctgcc tgtattcctc tttttatgag 180 taataaagat gtggctgccg aggcggtgac tggcagtggg aagactcttg cctttgtgat 240 ccctgcatta gaaattcttc taaagcgaga agagaagtta aaaaagatgc aggtcggtgc 300 gttgatcata acgcccaccc gtgagctggc tatgcaaatc agcgaagtga tgggtcgttt 360 tctgcaggga ttccctcagt ttacacagat ccttttaatt ggtggaagca accctattga 420 ggacgtggag aagttgaaga ctcagggagc caacatcata attgcgactc ccggccgatt 480 ggaggacatg tttaggagga aggctgatgg acttgatttg gccacggctg tgaagtctct 540 agatgtccta gtcctggatg aagctgacag attgttggac atgggctttg aagccagttt 600 aaacaccatt ctggggtatt tacccaagca gcggcgcaca gggcttttct ccgccactca 660 gacgcaggag ctggagaagc tggtgagagc cggtctcaga aaccctgtgc gaatcaccgt 720 gaaggagaaa ggggtcgcgg cctccagtgt gcagaaaacc cctgccaaac tcagcaacta 780 ctacaccatg tgtcgagcag aggagaagtt caacactttg gtggcctttc tcaggcagca 840 caagcatgag aaacagctgg ttttcttcag cacctgtgcc tgtgtggaat atttcggaaa 900 agccctggaa gtcctggtca agaacgtcag catccactgc attcatggaa agatgaagca 960 caaacgcaac aagatttttg ccgatttcag ggctctgaag agtgggattc tggtgtgcac 1020 agacgtcatg gcgagaggca tcgacatacc tgaagtcaac tgggtgcttc agtatgaccc 1080 gccaagcagt gcaagttcct ttgtgcatcg ctgtggacga actgcacgca ttggaaacca 1140 aggaaatgcg cttgtctttc ttctgccaat ggaagagtca tatgtcaatt tcttgtccat 1200 taatcaaaag tgtccccttc agtcgttttc ctcagtgaaa gatgtggttg atgttctgcc 1260 gaaattgaag gcgatggctt taggcgaccg agcgatgttt gagaagggca tgagagcctt 1320 tgtgtcgtat gtgcaggcgt acgccaaaca tgagtgcagt ctgatatttc gcattaaaga 1380 cctagatttc gccgctctgg ctcgtggctt tgctcttctc cgattaccca aaatgcctga 1440 attaagagga aagaccttcc cagacttcaa agcggaggcc attgacacag acaccatccg 1500 ctttaaagat aaaaacaggg agaaacagag acaaaagtgg ctggcggagc agaaagagaa 1560 agaggtgccg ctgaggaaga acttcatcaa gaacaaagcc tggtccaagc agaagatcaa 1620 gaaggatagg aagaagaaga ggcttccaaa agcaaagcta gatcaggatt cggacgctgc 1680 agaggaagac ttaaacgagc ttatgaatga cacccgtctg ctgaagaaac tgaagaaagg 1740 gaaaatcacc gaggaggact tcgacaagca aatgagttca acggacaaac acaaaccagc 1800 aggaatagac agttctgatg gagactgaga aagagacgtc atgtgtgaga cttcatgctg 1860 tcgtttcaat tcgatatgct gtatcagcag ggttacttcc tttctgattg actgtctgtg 1920 tgatgaacga agtccatgcg ttcatcacca atgtgttttt gtactaacaa tattatacac 1980 agtacgtcac ttggaattaa aaatgtataa tgctgtcaaa tcatctctgc aggactgggg 2040 gatttatact ttttcattaa aaataatgtt gagatgaatt gcgctgaaaa taaaccaaat 2100 ggaagatgac tcccaccccc caaaaaaaaa aaaaaaaaaa aaaa 2144 32 593 PRT Danio rerio 32 Met Glu Asn Ile Thr Asp Gly Gly Trp Gly Ser Leu Pro Val Lys Leu 1 5 10 15 His Asp Asn Ile Leu Gln Thr Leu Lys Glu Leu Gly Phe Thr Tyr Met 20 25 30 Thr Pro Val Gln Ser Ala Cys Ile Pro Leu Phe Met Ser Asn Lys Asp 35 40 45 Val Ala Ala Glu Ala Val Thr Gly Ser Gly Lys Thr Leu Ala Phe Val 50 55 60 Ile Pro Ala Leu Glu Ile Leu Leu Lys Arg Glu Glu Lys Leu Lys Lys 65 70 75 80 Met Gln Val Gly Ala Leu Ile Ile Thr Pro Thr Arg Glu Leu Ala Met 85 90 95 Gln Ile Ser Glu Val Met Gly Arg Phe Leu Gln Gly Phe Pro Gln Phe 100 105 110 Thr Gln Ile Leu Leu Ile Gly Gly Ser Asn Pro Ile Glu Asp Val Glu 115 120 125 Lys Leu Lys Thr Gln Gly Ala Asn Ile Ile Ile Ala Thr Pro Gly Arg 130 135 140 Leu Glu Asp Met Phe Arg Arg Lys Ala Asp Gly Leu Asp Leu Ala Thr 145 150 155 160 Ala Val Lys Ser Leu Asp Val Leu Val Leu Asp Glu Ala Asp Arg Leu 165 170 175 Leu Asp Met Gly Phe Glu Ala Ser Leu Asn Thr Ile Leu Gly Tyr Leu 180 185 190 Pro Lys Gln Arg Arg Thr Gly Leu Phe Ser Ala Thr Gln Thr Gln Glu 195 200 205 Leu Glu Lys Leu Val Arg Ala Gly Leu Arg Asn Pro Val Arg Ile Thr 210 215 220 Val Lys Glu Lys Gly Val Ala Ala Ser Ser Val Gln Lys Thr Pro Ala 225 230 235 240 Lys Leu Ser Asn Tyr Tyr Thr Met Cys Arg Ala Glu Glu Lys Phe Asn 245 250 255 Thr Leu Val Ala Phe Leu Arg Gln His Lys His Glu Lys Gln Leu Val 260 265 270 Phe Phe Ser Thr Cys Ala Cys Val Glu Tyr Phe Gly Lys Ala Leu Glu 275 280 285 Val Leu Val Lys Asn Val Ser Ile His Cys Ile His Gly Lys Met Lys 290 295 300 His Lys Arg Asn Lys Ile Phe Ala Asp Phe Arg Ala Leu Lys Ser Gly 305 310 315 320 Ile Leu Val Cys Thr Asp Val Met Ala Arg Gly Ile Asp Ile Pro Glu 325 330 335 Val Asn Trp Val Leu Gln Tyr Asp Pro Pro Ser Ser Ala Ser Ser Phe 340 345 350 Val His Arg Cys Gly Arg Thr Ala Arg Ile Gly Asn Gln Gly Asn Ala 355 360 365 Leu Val Phe Leu Leu Pro Met Glu Glu Ser Tyr Val Asn Phe Leu Ser 370 375 380 Ile Asn Gln Lys Cys Pro Leu Gln Ser Phe Ser Ser Val Lys Asp Val 385 390 395 400 Val Asp Val Leu Pro Lys Leu Lys Ala Met Ala Leu Gly Asp Arg Ala 405 410 415 Met Phe Glu Lys Gly Met Arg Ala Phe Val Ser Tyr Val Gln Ala Tyr 420 425 430 Ala Lys His Glu Cys Ser Leu Ile Phe Arg Ile Lys Asp Leu Asp Phe 435 440 445 Ala Ala Leu Ala Arg Gly Phe Ala Leu Leu Arg Leu Pro Lys Met Pro 450 455 460 Glu Leu Arg Gly Lys Thr Phe Pro Asp Phe Lys Ala Glu Ala Ile Asp 465 470 475 480 Thr Asp Thr Ile Arg Phe Lys Asp Lys Asn Arg Glu Lys Gln Arg Gln 485 490 495 Lys Trp Leu Ala Glu Gln Lys Glu Lys Glu Val Pro Leu Arg Lys Asn 500 505 510 Phe Ile Lys Asn Lys Ala Trp Ser Lys Gln Lys Ile Lys Lys Asp Arg 515 520 525 Lys Lys Lys Arg Leu Pro Lys Ala Lys Leu Asp Gln Asp Ser Asp Ala 530 535 540 Ala Glu Glu Asp Leu Asn Glu Leu Met Asn Asp Thr Arg Leu Leu Lys 545 550 555 560 Lys Leu Lys Lys Gly Lys Ile Thr Glu Glu Asp Phe Asp Lys Gln Met 565 570 575 Ser Ser Thr Asp Lys His Lys Pro Ala Gly Ile Asp Ser Ser Asp Gly 580 585 590 Asp 33 935 DNA Danio rerio 33 tgcacttcat taagtgttgg cgcagagacg cgtctgtaaa ccaataaagc gacgaagagg 60 acgttattca tctcgggctt taaattaacg tcaccaccga gagtgacatg cagtctttgt 120 acccgtaaat gattacaaac taacagagcg aactaaaaat caagatagag tcacaggatg 180 tactttactt taggtaacgt taacgttagc ggtatctatt gtcattctcc ccaagctgca 240 gtcgtgctcg tcgtaatggg tttccatggg cggagtgtgt aaaaaccaag ggtctccttc 300 agtctcggct ccagcatcag cagcaggtca tcatgaggat gagactgaag ctgaagacag 360 tgtttgtcct gtacttcatg gtgtctctgt ttgggctcct gtacgcgctg atgcagctcg 420 gtcagcgctg tgactgtcga gatcatgagc aatctaaaga tcagcagatc tctcagctga 480 aaggggaact gcagaagctg caggagcaca tcaaaacatc agagctgtcc aagaagacgg 540 atgtgcccag aatatacgtc ataacaccca catatgctag actggtgcag aaagctgagc 600 tcactcgtct atctcacaca ttcctccacg ttccccagct acactggatc gtggtggaag 660 acgctcctca gcaaactcag ctcgtctccg acttcctgtc tgcttccggc ttgacctaca 720 ctcatctaaa caagctgacg cccaaggaga ggaagctgca ggagggagac cccaactggc 780 tgaagccccg gggggctgag cagaggaacg agggtctgcg ctggctcaga tggatgggtt 840 ctacagttca cgggaaagaa gcagctgccc ttgaagaggc cgtggtttac tttgctgatg 900 atgacaacac gtacagccta cagctttttg aagag 935 34 201 PRT Danio rerio 34 Met Arg Met Arg Leu Lys Leu Lys Thr Val Phe Val Leu Tyr Phe Met 1 5 10 15 Val Ser Leu Phe Gly Leu Leu Tyr Ala Leu Met Gln Leu Gly Gln Arg 20 25 30 Cys Asp Cys Arg Asp His Glu Gln Ser Lys Asp Gln Gln Ile Ser Gln 35 40 45 Leu Lys Gly Glu Leu Gln Lys Leu Gln Glu His Ile Lys Thr Ser Glu 50 55 60 Leu Ser Lys Lys Thr Asp Val Pro Arg Ile Tyr Val Ile Thr Pro Thr 65 70 75 80 Tyr Ala Arg Leu Val Gln Lys Ala Glu Leu Thr Arg Leu Ser His Thr 85 90 95 Phe Leu His Val Pro Gln Leu His Trp Ile Val Val Glu Asp Ala Pro 100 105 110 Gln Gln Thr Gln Leu Val Ser Asp Phe Leu Ser Ala Ser Gly Leu Thr 115 120 125 Tyr Thr His Leu Asn Lys Leu Thr Pro Lys Glu Arg Lys Leu Gln Glu 130 135 140 Gly Asp Pro Asn Trp Leu Lys Pro Arg Gly Ala Glu Gln Arg Asn Glu 145 150 155 160 Gly Leu Arg Trp Leu Arg Trp Met Gly Ser Thr Val His Gly Lys Glu 165 170 175 Ala Ala Ala Leu Glu Glu Ala Val Val Tyr Phe Ala Asp Asp Asp Asn 180 185 190 Thr Tyr Ser Leu Gln Leu Phe Glu Glu 195 200 35 874 DNA Danio rerio 35 attgtattca ctctcgctaa acttaacaat gattggtcag tgtaataatg agatcgcaga 60 gtgtgattaa tgtaatcgct gtgtttccgt cagtaatgcc cggatcagcg ttgttatcgc 120 tgcagcagga gctccggcgg ggattcgagg ctctaaaggc cgccaaagat acgtttgatg 180 gcggtcatgc ggagtgtcag gagctcgtct cctctctggg gaacctggcg ctccagctga 240 aggctctcaa acaggtccag atatccgaaa ctcctctctc cggattccca tgtttacagc 300 agcgactcca ttacaagctc tcgctggcgg tggacgcact tctggctaaa ctagccgaga 360 aaatggctgc tcttcagaag gtgcgggatg ccttcgctca gcaggtgttt actgtcgtgc 420 agctctatga gaaaaacaca gacgctctgg acatcagcgt gtgcgtctcc agatcagccg 480 tctctccttc agtctcagac atgctggagt ggcttcagga cgccgaccga tattaccgcc 540 tgcaattaat gcaaagccgg ctcctccttc agacgctgac ccccactggc cttacatcaa 600 tggaaacagc accaaataga tggagatctg tgcgttctgc ccgtcaggag gagagaattg 660 cagatgcatt gtgtcaagtg tcgatcttct tggagactga atgaatgaag gatccttaaa 720 aagagacaga ctgaactctg tctcctggat taatgtagtt ttatggtgcg atttagcatt 780 ataaggactg tgcatcagca tcactgtaag agcaatgctc tactgtaata aatgcaattc 840 atgcttttta tattcaaaaa aaaaaaaaaa aaaa 874 36 218 PRT Danio rerio 36 Met Arg Ser Gln Ser Val Ile Asn Val Ile Ala Val Phe Pro Ser Val 1 5 10 15 Met Pro Gly Ser Ala Leu Leu Ser Leu Gln Gln Glu Leu Arg Arg Gly 20 25 30 Phe Glu Ala Leu Lys Ala Ala Lys Asp Thr Phe Asp Gly Gly His Ala 35 40 45 Glu Cys Gln Glu Leu Val Ser Ser Leu Gly Asn Leu Ala Leu Gln Leu 50 55 60 Lys Ala Leu Lys Gln Val Gln Ile Ser Glu Thr Pro Leu Ser Gly Phe 65 70 75 80 Pro Cys Leu Gln Gln Arg Leu His Tyr Lys Leu Ser Leu Ala Val Asp 85 90 95 Ala Leu Leu Ala Lys Leu Ala Glu Lys Met Ala Ala Leu Gln Lys Val 100 105 110 Arg Asp Ala Phe Ala Gln Gln Val Phe Thr Val Val Gln Leu Tyr Glu 115 120 125 Lys Asn Thr Asp Ala Leu Asp Ile Ser Val Cys Val Ser Arg Ser Ala 130 135 140 Val Ser Pro Ser Val Ser Asp Met Leu Glu Trp Leu Gln Asp Ala Asp 145 150 155 160 Arg Tyr Tyr Arg Leu Gln Leu Met Gln Ser Arg Leu Leu Leu Gln Thr 165 170 175 Leu Thr Pro Thr Gly Leu Thr Ser Met Glu Thr Ala Pro Asn Arg Trp 180 185 190 Arg Ser Val Arg Ser Ala Arg Gln Glu Glu Arg Ile Ala Asp Ala Leu 195 200 205 Cys Gln Val Ser Ile Phe Leu Glu Thr Glu 210 215 37 1050 DNA Danio rerio 37 caacccacag aaggcggggg gtggttttca agatggcgcc gctttcttcg tccacacgtg 60 aggtgttcag tgaagcagtt cgggctgttt tggaaacttg gccggtgctg caaatagcgg 120 tggacaacgg cttcggtgga gcgttcagtc agcagaaggc ggagtggatg gtggacgcgc 180 tgcagcaata cttcaccgac aacgatgatc tccagccgga tgaggtggag gacttcatct 240 ctgagctcat gaacaatgag ttcgacacag tggtggatga tggcagtctg cctcaggtgg 300 cgcagcaggt gtgtgggatg ttccagcagt gtgagcaggg cagattgttg gaggtcagag 360 agcagatcct taaactgaac cagaagaaga caagcagcgg aagagctaaa gccactcctg 420 cacaaacacc tcaggatgat gatgatgatg atgatgagga agaggctatg gattgtgaag 480 gaggatcagc gggggcttca gtcagcagta cagcagtgaa tgatcttcat catgaggagg 540 aggaagatga cggatggaca gtcgtgcgca gaaaaaagtg accaatctct cacttcctgc 600 attcatgaca tttaaagaca tttgaagggc ttgtgtcact tctactttgg acttttctgc 660 tttttcagcc actgatataa aatggtagga tctcctcagg caaccttatg aagtcctgtc 720 tctggagcac agattggatg ttgtccaggg aaattcaggt gcatatacag tcatgcattt 780 aaatgtttat ggtcgagtca tttatcacag tttctctgtt tgacttcatc tgaatgcatt 840 tatttataag cggtaggtca atgcctgctc ctctgtcgcc cccttgtgga actatagaat 900 ccagcagcag tggtgaatat gctgtttaac aggcacacta agagaaaaaa ctgtagttac 960 agagcttgtt aaatgattaa catcataatt tcttccacat tttaaataaa aatattgtct 1020 gttttcaaaa aaaaaaaaaa aaaaaaaaaa 1050 38 182 PRT Danio rerio 38 Met Ala Pro Leu Ser Ser Ser Thr Arg Glu Val Phe Ser Glu Ala Val 1 5 10 15 Arg Ala Val Leu Glu Thr Trp Pro Val Leu Gln Ile Ala Val Asp Asn 20 25 30 Gly Phe Gly Gly Ala Phe Ser Gln Gln Lys Ala Glu Trp Met Val Asp 35 40 45 Ala Leu Gln Gln Tyr Phe Thr Asp Asn Asp Asp Leu Gln Pro Asp Glu 50 55 60 Val Glu Asp Phe Ile Ser Glu Leu Met Asn Asn Glu Phe Asp Thr Val 65 70 75 80 Val Asp Asp Gly Ser Leu Pro Gln Val Ala Gln Gln Val Cys Gly Met 85 90 95 Phe Gln Gln Cys Glu Gln Gly Arg Leu Leu Glu Val Arg Glu Gln Ile 100 105 110 Leu Lys Leu Asn Gln Lys Lys Thr Ser Ser Gly Arg Ala Lys Ala Thr 115 120 125 Pro Ala Gln Thr Pro Gln Asp Asp Asp Asp Asp Asp Asp Glu Glu Glu 130 135 140 Ala Met Asp Cys Glu Gly Gly Ser Ala Gly Ala Ser Val Ser Ser Thr 145 150 155 160 Ala Val Asn Asp Leu His His Glu Glu Glu Glu Asp Asp Gly Trp Thr 165 170 175 Val Val Arg Arg Lys Lys 180 39 2999 DNA Danio rerio 39 aaacttaatg cacaaaatct ccatctgata aaacaaataa agtgttaaat ccgagagcag 60 tctatcgggt tacatttttt gcagtctaat aataaacaat gtcgctttca gcagcgcgtt 120 cgtttctttc ggaggcggca tacggcgagc aggagctgga cgccaattca gcgctgatgg 180 agctcgataa aggtttacgg tcttgtaaac tgggcgagca gtgtgaagcc gtggtccttt 240 ttcccaaact cttccagaaa tacccgttcc ccatcctcat caactctgcc ttcctgaagc 300 tggcagatat cttcagactt ggcaacaatt tcctgcggct gtgtgtgctg aaggtcacgc 360 agctgagtga aaaacacttg gagaagatcc tgaatgtgga tgagtttgtc aaaagagtgt 420 tttctgtcat ccacagcaat gaccctgtgg cccgagcaat caccctcaga atgctgggaa 480 gtttagcttc catcattcca gagagaaaga acgctcacca cagcatccgc caaagcctgg 540 actctcatga taatgttgaa gtagaagctg ctatttttgc agccgctagc ttctcatcac 600 actccaaaga ttttgcagca ggaatttgca acaagattag tgagatgatt caaggactgg 660 acactccagt agagctaaag ctgaagttga tcccgatgct gcagcacatg caccatgatg 720 ccagcctggc gtcgtgcagc agagagttac tgcaggagct ggtgtcctcg tacccatcca 780 catccatgct gatcgtcaca ctgcacacct ttacccagct ggccacatct tcactcgttg 840 acataccgga gcagatttgt cttcttctgc agtacttgaa ggaagacccc agaaaagctg 900 taaaaagact ctccattcag gatctcaagc tcctggccaa aaaggctcct catttgtgga 960 ccaggaaaaa catacaggtg ctgtgcgagt gtgcactcca caccccatat aacagcctga 1020 agctgggaat gctgtctgtc ctctccacat tgtctggcac catcgctatt aaacagtact 1080 tcagccctaa tgcaggtgat tcatctcctg cacctcacca cactgacctg gtaaagctgg 1140 cacaggagtg ctgttatcac agtgacctgg cagttgcagc acatggaatc acagtgttga 1200 ctagcattgc tgctttctgc cctgagaaag aggtcattca gctggaacag gaaacagtga 1260 tggggatgga gtctctcatt ctgctctgca gtcaagatga cagtaagact gcccaggcca 1320 cactaaaaac ggcgctcacg tctctggtgc agatgttgaa gacttgtcct catctcagtc 1380 agagctcagt ggagctcctg cttaggcagc tgcattgcgc atgtgaccct gctcgggtgc 1440 tcatgtgtca ggctctggct gctattgcca cacaacagcc agtgctggta gagggaatgc 1500 tgggagacct gctggagctc ttcagggtgg caagtcacag gacttcagag aaacagcagg 1560 agcttctggt gtctttggca actgtgttat ttgtggccag tcaggcgtct ctctcagctg 1620 aggtgaaggc tgtgatcaga caacagttgg agaatgtggc caatggctgg accgtgtacc 1680 aaattgccag acaagcgtcc cgcatgggct gccatgattt ttccagggag ttgtaccaga 1740 gtcttcgcac ccgtgtggcc tccgaacact tttacttctg gcttaacagt ctgatggagt 1800 tttcccaagc ggagcagtgc cttagtggcc tggaagatgg agactacagt gcagccatga 1860 gcgccatctc tgaagcactg aaatcctacc agaagggaat cgcatctctc acggctgcca 1920 gtacgccgct cagtccgctg acattccaat gtgagtttgt gaagctgcgt atagacactc 1980 ttcaggctct gtctcagctc atttgcacct gcaacagtct gaagaccagt cctcctcctg 2040 ccatcgcaac cacgatcgct ctgtcctcgg gcagtgactt gcagcgctgc ggacgaatct 2100 ccacacagat gaagttttcc atggatgagt ttagaagtct cgcagctcgc tatgctgacc 2160 tgtaccagtc ttcgtttgat gcggactatg ccacccttcg caatgttgaa ctacaacagc 2220 agagttgctt actcgtatct tatgttatcg aagctttaat aatcgatcca caaacagcca 2280 gtttccagga gtttggcact catggatcaa ttctggcaga gagcgagtat gagctgagaa 2340 tgatggctgt gttcaatcat gtcctggagg aagtggaaaa tctaagcagg aaacaccctc 2400 ctgtttctta cctgcacacc ggctgtctgt gtgatactgt catagccatc ctgaaaatac 2460 cactgtcgtt ccagagatat ttctttcaga agcttcagtc caccagcatt aaacttgctc 2520 tctctccatc tccccgcaca cccaatgagc cgatcccagt gcagaacaac cagcagctga 2580 ccttaaaagt ggagggtgtt attcagcacg gctccagtcc gggactcttc cggaagattc 2640 aggcagtctg tcttaaagtc agctcgacgt tacagacaaa accaggatca gacttcaaga 2700 ttcctctgga atccaaaacc aacgagattg aacagaaagt agagcctcat aacgattact 2760 tcagcacaca gttcctgctc aacttctcca tcctgggcac acaccaggta tctgttgaag 2820 cctcagtggt ggacaccagt gggattgaat ggaagaccgg ccctaaaaac actgtttcgg 2880 tgaagtctct ggaagaccca tattcccagc agcttcggca tcaattacag cagcagcagc 2940 agaacgtccc gcagcctgca gcccagagga acatcagcac acgcttccaa taaaaacac 2999 40 965 PRT Danio rerio 40 Met Ser Leu Ser Ala Ala Arg Ser Phe Leu Ser Glu Ala Ala Tyr Gly 1 5 10 15 Glu Gln Glu Leu Asp Ala Asn Ser Ala Leu Met Glu Leu Asp Lys Gly 20 25 30 Leu Arg Ser Cys Lys Leu Gly Glu Gln Cys Glu Ala Val Val Leu Phe 35 40 45 Pro Lys Leu Phe Gln Lys Tyr Pro Phe Pro Ile Leu Ile Asn Ser Ala 50 55 60 Phe Leu Lys Leu Ala Asp Ile Phe Arg Leu Gly Asn Asn Phe Leu Arg 65 70 75 80 Leu Cys Val Leu Lys Val Thr Gln Leu Ser Glu Lys His Leu Glu Lys 85 90 95 Ile Leu Asn Val Asp Glu Phe Val Lys Arg Val Phe Ser Val Ile His 100 105 110 Ser Asn Asp Pro Val Ala Arg Ala Ile Thr Leu Arg Met Leu Gly Ser 115 120 125 Leu Ala Ser Ile Ile Pro Glu Arg Lys Asn Ala His His Ser Ile Arg 130 135 140 Gln Ser Leu Asp Ser His Asp Asn Val Glu Val Glu Ala Ala Ile Phe 145 150 155 160 Ala Ala Ala Ser Phe Ser Ser His Ser Lys Asp Phe Ala Ala Gly Ile 165 170 175 Cys Asn Lys Ile Ser Glu Met Ile Gln Gly Leu Asp Thr Pro Val Glu 180 185 190 Leu Lys Leu Lys Leu Ile Pro Met Leu Gln His Met His His Asp Ala 195 200 205 Ser Leu Ala Ser Cys Ser Arg Glu Leu Leu Gln Glu Leu Val Ser Ser 210 215 220 Tyr Pro Ser Thr Ser Met Leu Ile Val Thr Leu His Thr Phe Thr Gln 225 230 235 240 Leu Ala Thr Ser Ser Leu Val Asp Ile Pro Glu Gln Ile Cys Leu Leu 245 250 255 Leu Gln Tyr Leu Lys Glu Asp Pro Arg Lys Ala Val Lys Arg Leu Ser 260 265 270 Ile Gln Asp Leu Lys Leu Leu Ala Lys Lys Ala Pro His Leu Trp Thr 275 280 285 Arg Lys Asn Ile Gln Val Leu Cys Glu Cys Ala Leu His Thr Pro Tyr 290 295 300 Asn Ser Leu Lys Leu Gly Met Leu Ser Val Leu Ser Thr Leu Ser Gly 305 310 315 320 Thr Ile Ala Ile Lys Gln Tyr Phe Ser Pro Asn Ala Gly Asp Ser Ser 325 330 335 Pro Ala Pro His His Thr Asp Leu Val Lys Leu Ala Gln Glu Cys Cys 340 345 350 Tyr His Ser Asp Leu Ala Val Ala Ala His Gly Ile Thr Val Leu Thr 355 360 365 Ser Ile Ala Ala Phe Cys Pro Glu Lys Glu Val Ile Gln Leu Glu Gln 370 375 380 Glu Thr Val Met Gly Met Glu Ser Leu Ile Leu Leu Cys Ser Gln Asp 385 390 395 400 Asp Ser Lys Thr Ala Gln Ala Thr Leu Lys Thr Ala Leu Thr Ser Leu 405 410 415 Val Gln Met Leu Lys Thr Cys Pro His Leu Ser Gln Ser Ser Val Glu 420 425 430 Leu Leu Leu Arg Gln Leu His Cys Ala Cys Asp Pro Ala Arg Val Leu 435 440 445 Met Cys Gln Ala Leu Ala Ala Ile Ala Thr Gln Gln Pro Val Leu Val 450 455 460 Glu Gly Met Leu Gly Asp Leu Leu Glu Leu Phe Arg Val Ala Ser His 465 470 475 480 Arg Thr Ser Glu Lys Gln Gln Glu Leu Leu Val Ser Leu Ala Thr Val 485 490 495 Leu Phe Val Ala Ser Gln Ala Ser Leu Ser Ala Glu Val Lys Ala Val 500 505 510 Ile Arg Gln Gln Leu Glu Asn Val Ala Asn Gly Trp Thr Val Tyr Gln 515 520 525 Ile Ala Arg Gln Ala Ser Arg Met Gly Cys His Asp Phe Ser Arg Glu 530 535 540 Leu Tyr Gln Ser Leu Arg Thr Arg Val Ala Ser Glu His Phe Tyr Phe 545 550 555 560 Trp Leu Asn Ser Leu Met Glu Phe Ser Gln Ala Glu Gln Cys Leu Ser 565 570 575 Gly Leu Glu Asp Gly Asp Tyr Ser Ala Ala Met Ser Ala Ile Ser Glu 580 585 590 Ala Leu Lys Ser Tyr Gln Lys Gly Ile Ala Ser Leu Thr Ala Ala Ser 595 600 605 Thr Pro Leu Ser Pro Leu Thr Phe Gln Cys Glu Phe Val Lys Leu Arg 610 615 620 Ile Asp Thr Leu Gln Ala Leu Ser Gln Leu Ile Cys Thr Cys Asn Ser 625 630 635 640 Leu Lys Thr Ser Pro Pro Pro Ala Ile Ala Thr Thr Ile Ala Leu Ser 645 650 655 Ser Gly Ser Asp Leu Gln Arg Cys Gly Arg Ile Ser Thr Gln Met Lys 660 665 670 Phe Ser Met Asp Glu Phe Arg Ser Leu Ala Ala Arg Tyr Ala Asp Leu 675 680 685 Tyr Gln Ser Ser Phe Asp Ala Asp Tyr Ala Thr Leu Arg Asn Val Glu 690 695 700 Leu Gln Gln Gln Ser Cys Leu Leu Val Ser Tyr Val Ile Glu Ala Leu 705 710 715 720 Ile Ile Asp Pro Gln Thr Ala Ser Phe Gln Glu Phe Gly Thr His Gly 725 730 735 Ser Ile Leu Ala Glu Ser Glu Tyr Glu Leu Arg Met Met Ala Val Phe 740 745 750 Asn His Val Leu Glu Glu Val Glu Asn Leu Ser Arg Lys His Pro Pro 755 760 765 Val Ser Tyr Leu His Thr Gly Cys Leu Cys Asp Thr Val Ile Ala Ile 770 775 780 Leu Lys Ile Pro Leu Ser Phe Gln Arg Tyr Phe Phe Gln Lys Leu Gln 785 790 795 800 Ser Thr Ser Ile Lys Leu Ala Leu Ser Pro Ser Pro Arg Thr Pro Asn 805 810 815 Glu Pro Ile Pro Val Gln Asn Asn Gln Gln Leu Thr Leu Lys Val Glu 820 825 830 Gly Val Ile Gln His Gly Ser Ser Pro Gly Leu Phe Arg Lys Ile Gln 835 840 845 Ala Val Cys Leu Lys Val Ser Ser Thr Leu Gln Thr Lys Pro Gly Ser 850 855 860 Asp Phe Lys Ile Pro Leu Glu Ser Lys Thr Asn Glu Ile Glu Gln Lys 865 870 875 880 Val Glu Pro His Asn Asp Tyr Phe Ser Thr Gln Phe Leu Leu Asn Phe 885 890 895 Ser Ile Leu Gly Thr His Gln Val Ser Val Glu Ala Ser Val Val Asp 900 905 910 Thr Ser Gly Ile Glu Trp Lys Thr Gly Pro Lys Asn Thr Val Ser Val 915 920 925 Lys Ser Leu Glu Asp Pro Tyr Ser Gln Gln Leu Arg His Gln Leu Gln 930 935 940 Gln Gln Gln Gln Asn Val Pro Gln Pro Ala Ala Gln Arg Asn Ile Ser 945 950 955 960 Thr Arg Phe Gln Glx 965 41 2348 DNA Danio rerio 41 gttcatcccg agacggacgg atggcgaagt gagtgggttg cgctttgaag agaaagcagc 60 gactcacaaa aattcttcaa cgttaaaacg tcggttgcct tgatgttcgt tcaaataagc 120 gggaaaacgc atctttaaga agaaaaatcg cccctcagag cgaggtgaat cgaagagagc 180 gtcgaggaca tcgactcctc cgtgtccttt caggctaagc tgctagtcag ctattcgggg 240 aattgctcaa gagaaaaaac agcctcaacc tcaggggaaa tagtcttgtt tttttttttt 300 ttacttaaat gtcgaggcag caaaacttat attgaaaccg atgtccgtta tcatcttcat 360 ttcgtcgttg tgagttaagg acacagctag taaggaagag ccgtttttag gctagctagc 420 attgtttcta ctggactgaa gaagagagct gcctgaaaac ctcgcccgtt taccagtttc 480 agtgtacttc tcggatttta ggacatggcc agcagcagtg gctctaaagc cgagtttata 540 gtcggtggga aatacaagct cgttcgtaaa atcggatctg gatccttcgg tgacatttat 600 ttggcaatca acatcacaaa tggagaggag gtagctgtga aattggaatc acagaaagcc 660 agacatcctc aactcctata tgaaagcaaa ttgtacaaaa ttctccaggg aggagtcggg 720 atcccacaca tcaggtggta tggccaagaa aaggactata atgtcctagt catggacctg 780 ctcggcccaa gtctggaaga tctctttaac ttctgttccc gcagattcac aatgaaaact 840 gtcctaatgc ttgcagatca gatgatcagc agaatcgagt atgtgcacac aaaaaacttc 900 atccacagag atatcaagcc agacaacttt ttaatgggta ttggccgtca ctgtaataag 960 ttgttcctca tcgattttgg tctggccaag aaatacagag acaacaggac acgacagcac 1020 ataccctaca gagaagacaa aaacctcaca ggcacagctc gctatgccag catcaacgca 1080 cacttgggca tcgagcagag tcgtcgagat gacatggagt cgctaggata tgtcctgatg 1140 tacttcaaca gaaccagcct gccttggcag ggactgaagg ctgccacaaa gaaacagaaa 1200 tatgagaaga ttagtgagaa aaagatgtcg acccctgttg aggtgttgtg taagggtttc 1260 ccagcagagt ttgccatgta cctgaactac tgtcgtgggt tgcggtttga agaggcgcct 1320 gactacatgt acctgcgtca gcttttccgc attcttttca ggactctgaa ccaccagtat 1380 gactacacat ttgactggac catgctgaag cagaaagcag cacagcaagc ggcctcctca 1440 gggggacagg ggcaacaggc acaaaccccc acaggtttct aagcattaag acaatgtaat 1500 gaagcagagc agagtggtcg tagcaggatt tgccttccaa tcacaccccg gcatctagga 1560 tcaaattcac cagaacctgc aggcactgtg ggcaaccttt ttcctgttaa agaactttcg 1620 ttccatatac tatatatgta attggtatat atagataaga tgctccgcat atatctatat 1680 attggtatgt atatactata tccacacaca gaagcaggct tggtttggag ttggaagctg 1740 tagccttttc tattgtcctg tcccttaaga cttttcgccc cctccttctt tgaaacggaa 1800 aactgtagaa aatggggatg taattgcgca catgccaatg tttttcccga gttttctcca 1860 tcaggaaatc tgtttggggt ccttcctgaa ttcttggttg cccatcagag cagttttggc 1920 ccccttttcc cctttgtcat cgtcacagag tgtggcacga acccagacac tcttcagaat 1980 ggagacatcc gagacacatt ttcatccttt aattttttta ttcttcttcc tttcttcaaa 2040 caaagtgaaa catctgagat taaacctgag tttgtttttc gttctttctt tttcttttgg 2100 ttttgttcca agtgtatcca aataattaaa aaggcaaaca ttgggttttg ttcttttttt 2160 ctaaaaaggt aattacgatt ctgcctgctg atatgtgcta gagagcttct aatgtgaaga 2220 ttaaaaacat tctagtgact actaggtttc tctttagagc gctgcattaa ctatattgta 2280 cttggtattc tgaagtctaa aaaaaataaa ataaataaaa gagtaaactc ccaccaaaaa 2340 aaaaaaaa 2348 42 325 PRT Danio rerio 42 Met Ala Ser Ser Ser Gly Ser Lys Ala Glu Phe Ile Val Gly Gly Lys 1 5 10 15 Tyr Lys Leu Val Arg Lys Ile Gly Ser Gly Ser Phe Gly Asp Ile Tyr 20 25 30 Leu Ala Ile Asn Ile Thr Asn Gly Glu Glu Val Ala Val Lys Leu Glu 35 40 45 Ser Gln Lys Ala Arg His Pro Gln Leu Leu Tyr Glu Ser Lys Leu Tyr 50 55 60 Lys Ile Leu Gln Gly Gly Val Gly Ile Pro His Ile Arg Trp Tyr Gly 65 70 75 80 Gln Glu Lys Asp Tyr Asn Val Leu Val Met Asp Leu Leu Gly Pro Ser 85 90 95 Leu Glu Asp Leu Phe Asn Phe Cys Ser Arg Arg Phe Thr Met Lys Thr 100 105 110 Val Leu Met Leu Ala Asp Gln Met Ile Ser Arg Ile Glu Tyr Val His 115 120 125 Thr Lys Asn Phe Ile His Arg Asp Ile Lys Pro Asp Asn Phe Leu Met 130 135 140 Gly Ile Gly Arg His Cys Asn Lys Leu Phe Leu Ile Asp Phe Gly Leu 145 150 155 160 Ala Lys Lys Tyr Arg Asp Asn Arg Thr Arg Gln His Ile Pro Tyr Arg 165 170 175 Glu Asp Lys Asn Leu Thr Gly Thr Ala Arg Tyr Ala Ser Ile Asn Ala 180 185 190 His Leu Gly Ile Glu Gln Ser Arg Arg Asp Asp Met Glu Ser Leu Gly 195 200 205 Tyr Val Leu Met Tyr Phe Asn Arg Thr Ser Leu Pro Trp Gln Gly Leu 210 215 220 Lys Ala Ala Thr Lys Lys Gln Lys Tyr Glu Lys Ile Ser Glu Lys Lys 225 230 235 240 Met Ser Thr Pro Val Glu Val Leu Cys Lys Gly Phe Pro Ala Glu Phe 245 250 255 Ala Met Tyr Leu Asn Tyr Cys Arg Gly Leu Arg Phe Glu Glu Ala Pro 260 265 270 Asp Tyr Met Tyr Leu Arg Gln Leu Phe Arg Ile Leu Phe Arg Thr Leu 275 280 285 Asn His Gln Tyr Asp Tyr Thr Phe Asp Trp Thr Met Leu Lys Gln Lys 290 295 300 Ala Ala Gln Gln Ala Ala Ser Ser Gly Gly Gln Gly Gln Gln Ala Gln 305 310 315 320 Thr Pro Thr Gly Phe 325 43 1400 DNA Danio rerio 43 gccacttcaa agacccaaaa cgacctcact gaccatcaaa agactgcaag tttctataag 60 tgaacttaga tgaccggcca gcactcatga cattcacttt ccagaggacg gtgtaagaga 120 aagagctcag agactttatt tcaataactg cgtgtggatt attaccttga tttgacatgt 180 tttcctgcgg gctcctgagc gtagttttgg ccctcgcggt cggcctggtg agctgcacgg 240 gaaaccttgc tggatttcaa gagaccctca ggaataaaat gcgagcagct ggcagaaacg 300 gcggaggtcg ggcaggtcac ggccgacatc tgaccagata tccgctgtat atgatgcacc 360 tctaccggac acttctgact ggagacgaaa aacacttcag ccatgagaat ccaactcttt 420 acgagtctga ctccgtcttg agcctcgtcg ctaaaagttg tcatcaggtt ggtgacaaat 480 tcgcagtgac atttgacatg tcctccatat cagcaagcga tgacgtccag cgagctgaac 540 ttcgtattcg gcttccgcat ctcaggtctg agttggaggt ggatatttat cacgcatcta 600 caccggagtg tgagagaagc ccctgcgaag aagtccgggt ccacctggga accttgaacg 660 ccaacccgat taactcaacc ttccgctcct cttggaggat cttcaacatc actgcgctcc 720 tcaagtactg gttgcaccag agtgagcgtg tgccgtttga ggaacccaca cagatgccgc 780 caatggctga gggacacaag agcgttcatc atcctacagc gaaccgagtg atgatggtgg 840 tttactccaa gcagaaccgg gcaaagacgt ccactctcat ccggactgcc gagcactcca 900 agtatgtagc tctggatcgg gctggtggtg gaagtgaacc tgtgcctcgg cgccacagaa 960 ggaaccacag aactgatgat agggtgcgcg atgcagcagc agggatgatt cctggtgttt 1020 cccatgaagg tggagagaag aaacctctct gcaagaaggt ggatatgtgg gtggactttg 1080 atcagattgg ttggagcgac tggattgttt atccaaagcg ttacaacgct tatcggtgtg 1140 aaggcagttg tcccacacca gtagatgaaa ccttcactcc aacaaaccat gcatacatgc 1200 agagtctttt gaagctgcac cacccagatc gcgtcccatg cttgtcctgc gtacccaccc 1260 gtctggctcc actctccatg ctctactatg agaatggcaa gatggtcatg agacaccatg 1320 aaggcatggt tgttgcagaa tgcggctgcc actgattctt caaaccccaa aggaactcaa 1380 ctctagcact ttggatatgc 1400 44 392 PRT Danio rerio 44 Met Phe Ser Cys Gly Leu Leu Ser Val Val Leu Ala Leu Ala Val Gly 1 5 10 15 Leu Val Ser Cys Thr Gly Asn Leu Ala Gly Phe Gln Glu Thr Leu Arg 20 25 30 Asn Lys Met Arg Ala Ala Gly Arg Asn Gly Gly Gly Arg Ala Gly His 35 40 45 Gly Arg His Leu Thr Arg Tyr Pro Leu Tyr Met Met His Leu Tyr Arg 50 55 60 Thr Leu Leu Thr Gly Asp Glu Lys His Phe Ser His Glu Asn Pro Thr 65 70 75 80 Leu Tyr Glu Ser Asp Ser Val Leu Ser Leu Val Ala Lys Ser Cys His 85 90 95 Gln Val Gly Asp Lys Phe Ala Val Thr Phe Asp Met Ser Ser Ile Ser 100 105 110 Ala Ser Asp Asp Val Gln Arg Ala Glu Leu Arg Ile Arg Leu Pro His 115 120 125 Leu Arg Ser Glu Leu Glu Val Asp Ile Tyr His Ala Ser Thr Pro Glu 130 135 140 Cys Glu Arg Ser Pro Cys Glu Glu Val Arg Val His Leu Gly Thr Leu 145 150 155 160 Asn Ala Asn Pro Ile Asn Ser Thr Phe Arg Ser Ser Trp Arg Ile Phe 165 170 175 Asn Ile Thr Ala Leu Leu Lys Tyr Trp Leu His Gln Ser Glu Arg Val 180 185 190 Pro Phe Glu Glu Pro Thr Gln Met Pro Pro Met Ala Glu Gly His Lys 195 200 205 Ser Val His His Pro Thr Ala Asn Arg Val Met Met Val Val Tyr Ser 210 215 220 Lys Gln Asn Arg Ala Lys Thr Ser Thr Leu Ile Arg Thr Ala Glu His 225 230 235 240 Ser Lys Tyr Val Ala Leu Asp Arg Ala Gly Gly Gly Ser Glu Pro Val 245 250 255 Pro Arg Arg His Arg Arg Asn His Arg Thr Asp Asp Arg Val Arg Asp 260 265 270 Ala Ala Ala Gly Met Ile Pro Gly Val Ser His Glu Gly Gly Glu Lys 275 280 285 Lys Pro Leu Cys Lys Lys Val Asp Met Trp Val Asp Phe Asp Gln Ile 290 295 300 Gly Trp Ser Asp Trp Ile Val Tyr Pro Lys Arg Tyr Asn Ala Tyr Arg 305 310 315 320 Cys Glu Gly Ser Cys Pro Thr Pro Val Asp Glu Thr Phe Thr Pro Thr 325 330 335 Asn His Ala Tyr Met Gln Ser Leu Leu Lys Leu His His Pro Asp Arg 340 345 350 Val Pro Cys Leu Ser Cys Val Pro Thr Arg Leu Ala Pro Leu Ser Met 355 360 365 Leu Tyr Tyr Glu Asn Gly Lys Met Val Met Arg His His Glu Gly Met 370 375 380 Val Val Ala Glu Cys Gly Cys His 385 390 45 3045 DNA Danio rerio 45 caacacattt tcctccatcc caaaagtggg ggatggtgaa cgctatttgg tttgttctac 60 cttttcctgt aagaaactgg aaggaagcaa atttttgccg ttgaacttac aaaaagaaac 120 tattgttgac aactttttgc gcaccgcaat tgtacgatga acagattttc aacagtatgt 180 gcccgagaac ggaggcataa aaaactttgg aatcggacta aaaggtgttt cttttaatag 240 gaaattttgt ttaatcgcga ttctatgtcg tatccaaaat ttgtcgtgtt ttgaaaactc 300 aacgcgcatc gcgcacgagt ggaactcgca agtttatttg gaaacatttg atgaaacatc 360 gctactttgt tgcgtctcca agatgtcctc caagcgcccc tgctccattg ttggaagctt 420 ttggatgctt tggatctgga cagctacctc tatggtcgcc agggcggtca tcttgcaccc 480 aaatgaaact atctttaacg acttttgtaa gaaatctaca acttgtgaag tgctaaaata 540 caatacgtgc ctaggctcgc ctttgccgta cacgcacacg tctctgattc tggctgaaga 600 ctcagaaact caagaggaag ccttcgagaa attggccatg tggtctgggt tgagaaatgc 660 tcctcgctgt tgggccgtca ttcagccgtt gctgtgtgcc gtttacatgc caaagtgtga 720 gaatggaaag gtggagctgc ccagccagca cctgtgccaa gctacacgca atccatgcag 780 tattgtggag cgggagaggg gctggcccaa cttcctaaag tgtgaaaaca aggagcagtt 840 ccctaagggt tgccagaatg aagtacagaa gctcaaattc aacacttcag gccaatgcga 900 ggctccgttg gtcaaaaccg acatccaggc aagttggtat aaggatgtgg aaggctgtgg 960 gatccagtgc gacaaccctc tgttcacaga ggacgagcac tcagacatgc atagctacat 1020 cgccgtcttc ggtaccatca ctctcctctg taccttctta cacctggcca catttcttgc 1080 agactggaag aattccaacc gttaccccgc tgtgatcctc ttttacgtca acgcctgttt 1140 cttcattgga agtattggat ggctcgctca attcatggat ggagcccgca atgagatagt 1200 atgcaaaagc gacaacacta tgcgacttgg agagccatca tccaccgaga cgctgtcatg 1260 cgtgattatc ttcgtcattg tgtattactc gcttatgtct ggagttattt ggttcgttat 1320 gctcacttat gcctggcaca catcattcaa agcccttggc accacccatc agcctctgtc 1380 tggaaaaacg tcctatttcc acttggtcac gtggtctatt ccttttatac tgaccgttgc 1440 tatactggcc aattcacagg tggatgcaga ttcagttagt ggcatttgct ttgtcggcta 1500 taggtattat gaatatcgtg ccgggtttgt tctggctcct attgggtttg tccttgttat 1560 tggtggctat ttcttgatac gaggggttat gacattgttt tccattaaaa gcaaccatcc 1620 agggctgctc agcgaaaaag ctgccagtaa aatcaatgaa accatgctaa ggcttggcat 1680 atttggattc ctggcttttg gttttgtcct gattacattc ggatgtcatt tttatgactt 1740 cttcaatcaa gctgagtggg aaagaagctt cagggaatat gttctatgtg aagctaatgt 1800 caccatagct caccagacca acaagccgat cccagaatgt gccatcaaga acaggcccag 1860 tctgctggtg gggaagatca acctcttctc aatgtttggc acaggcattg caatgagcac 1920 ctgggtttgg acaaaggcaa caatcctcat ttggaaacgg acctggttca gaatcattgg 1980 acgaagcgac gatgagccga aacgcattaa aaagagtaaa atgattgcca aggcattctc 2040 aaagcgaaag gaactccaga aagatcctga gaaagagctg tcctttagca tgcacacagt 2100 ttcccatgag ggacctgtgg cgggtatcaa ctttgacctg aatgaaccgt cgattgagat 2160 gtcttctgca tgggcccaac atgtgaccaa gatggtggca agaagagggg caattttacc 2220 gcaggatatc tcagtgacac ccactggaac acccattccg cctccagagg aaaggaacaa 2280 actttggatg gtcgaagcag aaatttctcc tgaaatgatg aaaagaaaga agaagaagaa 2340 aaagcgaagg aaggaagtac ggccagcggg tcctgcagca gatgaaggga atcctgccta 2400 ccaccggagg gagtttggcc ccagtgcagt gccgcgtctc ccaaaacttc ctggtcaccg 2460 gagtctggtg gctaaccttt gggaacagca gcgacagcag caggaggaac aggatatgct 2520 acctggagcc ttcccagaat tcaggccttc gtgtccactt ccttaccagg ataggtacgg 2580 tgggctggga tatttgcgca ataaaccctc cagtctccct ctggccaatc ccctgacgct 2640 aagagatagc atgcaaggag atttaagtca cttccagcaa agctcatggc aacctaaagg 2700 ggttttccgg caccttgggc aggaggcttc tatgatggat gttggcagga ctgcggttgt 2760 gccacgggca gatggcagga ggggtgtaca aatacattcc agaaccaatc ttatggatgc 2820 tgagctattg gatgctgact cagattttta agtctttgga gttactggtg ctcattctgt 2880 gcataaaaac aaaaggtgct ggatacttac tccaaactat caactatgag cagaagtggg 2940 tttgaactgc tgtgccggct ctgcttgtgt aaaggataat atttttttgc aaccctcatg 3000 atgttatgaa ttttgaactg gatttatgaa acataaaaaa taaaa 3045 46 822 PRT Danio rerio 46 Met Ser Ser Lys Arg Pro Cys Ser Ile Val Gly Ser Phe Trp Met Leu 1 5 10 15 Trp Ile Trp Thr Ala Thr Ser Met Val Ala Arg Ala Val Ile Leu His 20 25 30 Pro Asn Glu Thr Ile Phe Asn Asp Phe Cys Lys Lys Ser Thr Thr Cys 35 40 45 Glu Val Leu Lys Tyr Asn Thr Cys Leu Gly Ser Pro Leu Pro Tyr Thr 50 55 60 His Thr Ser Leu Ile Leu Ala Glu Asp Ser Glu Thr Gln Glu Glu Ala 65 70 75 80 Phe Glu Lys Leu Ala Met Trp Ser Gly Leu Arg Asn Ala Pro Arg Cys 85 90 95 Trp Ala Val Ile Gln Pro Leu Leu Cys Ala Val Tyr Met Pro Lys Cys 100 105 110 Glu Asn Gly Lys Val Glu Leu Pro Ser Gln His Leu Cys Gln Ala Thr 115 120 125 Arg Asn Pro Cys Ser Ile Val Glu Arg Glu Arg Gly Trp Pro Asn Phe 130 135 140 Leu Lys Cys Glu Asn Lys Glu Gln Phe Pro Lys Gly Cys Gln Asn Glu 145 150 155 160 Val Gln Lys Leu Lys Phe Asn Thr Ser Gly Gln Cys Glu Ala Pro Leu 165 170 175 Val Lys Thr Asp Ile Gln Ala Ser Trp Tyr Lys Asp Val Glu Gly Cys 180 185 190 Gly Ile Gln Cys Asp Asn Pro Leu Phe Thr Glu Asp Glu His Ser Asp 195 200 205 Met His Ser Tyr Ile Ala Val Phe Gly Thr Ile Thr Leu Leu Cys Thr 210 215 220 Phe Leu His Leu Ala Thr Phe Leu Ala Asp Trp Lys Asn Ser Asn Arg 225 230 235 240 Tyr Pro Ala Val Ile Leu Phe Tyr Val Asn Ala Cys Phe Phe Ile Gly 245 250 255 Ser Ile Gly Trp Leu Ala Gln Phe Met Asp Gly Ala Arg Asn Glu Ile 260 265 270 Val Cys Lys Ser Asp Asn Thr Met Arg Leu Gly Glu Pro Ser Ser Thr 275 280 285 Glu Thr Leu Ser Cys Val Ile Ile Phe Val Ile Val Tyr Tyr Ser Leu 290 295 300 Met Ser Gly Val Ile Trp Phe Val Met Leu Thr Tyr Ala Trp His Thr 305 310 315 320 Ser Phe Lys Ala Leu Gly Thr Thr His Gln Pro Leu Ser Gly Lys Thr 325 330 335 Ser Tyr Phe His Leu Val Thr Trp Ser Ile Pro Phe Ile Leu Thr Val 340 345 350 Ala Ile Leu Ala Asn Ser Gln Val Asp Ala Asp Ser Val Ser Gly Ile 355 360 365 Cys Phe Val Gly Tyr Arg Tyr Tyr Glu Tyr Arg Ala Gly Phe Val Leu 370 375 380 Ala Pro Ile Gly Phe Val Leu Val Ile Gly Gly Tyr Phe Leu Ile Arg 385 390 395 400 Gly Val Met Thr Leu Phe Ser Ile Lys Ser Asn His Pro Gly Leu Leu 405 410 415 Ser Glu Lys Ala Ala Ser Lys Ile Asn Glu Thr Met Leu Arg Leu Gly 420 425 430 Ile Phe Gly Phe Leu Ala Phe Gly Phe Val Leu Ile Thr Phe Gly Cys 435 440 445 His Phe Tyr Asp Phe Phe Asn Gln Ala Glu Trp Glu Arg Ser Phe Arg 450 455 460 Glu Tyr Val Leu Cys Glu Ala Asn Val Thr Ile Ala His Gln Thr Asn 465 470 475 480 Lys Pro Ile Pro Glu Cys Ala Ile Lys Asn Arg Pro Ser Leu Leu Val 485 490 495 Gly Lys Ile Asn Leu Phe Ser Met Phe Gly Thr Gly Ile Ala Met Ser 500 505 510 Thr Trp Val Trp Thr Lys Ala Thr Ile Leu Ile Trp Lys Arg Thr Trp 515 520 525 Phe Arg Ile Ile Gly Arg Ser Asp Asp Glu Pro Lys Arg Ile Lys Lys 530 535 540 Ser Lys Met Ile Ala Lys Ala Phe Ser Lys Arg Lys Glu Leu Gln Lys 545 550 555 560 Asp Pro Glu Lys Glu Leu Ser Phe Ser Met His Thr Val Ser His Glu 565 570 575 Gly Pro Val Ala Gly Ile Asn Phe Asp Leu Asn Glu Pro Ser Ile Glu 580 585 590 Met Ser Ser Ala Trp Ala Gln His Val Thr Lys Met Val Ala Arg Arg 595 600 605 Gly Ala Ile Leu Pro Gln Asp Ile Ser Val Thr Pro Thr Gly Thr Pro 610 615 620 Ile Pro Pro Pro Glu Glu Arg Asn Lys Leu Trp Met Val Glu Ala Glu 625 630 635 640 Ile Ser Pro Glu Met Met Lys Arg Lys Lys Lys Lys Lys Lys Arg Arg 645 650 655 Lys Glu Val Arg Pro Ala Gly Pro Ala Ala Asp Glu Gly Asn Pro Ala 660 665 670 Tyr His Arg Arg Glu Phe Gly Pro Ser Ala Val Pro Arg Leu Pro Lys 675 680 685 Leu Pro Gly His Arg Ser Leu Val Ala Asn Leu Trp Glu Gln Gln Arg 690 695 700 Gln Gln Gln Glu Glu Gln Asp Met Leu Pro Gly Ala Phe Pro Glu Phe 705 710 715 720 Arg Pro Ser Cys Pro Leu Pro Tyr Gln Asp Arg Tyr Gly Gly Leu Gly 725 730 735 Tyr Leu Arg Asn Lys Pro Ser Ser Leu Pro Leu Ala Asn Pro Leu Thr 740 745 750 Leu Arg Asp Ser Met Gln Gly Asp Leu Ser His Phe Gln Gln Ser Ser 755 760 765 Trp Gln Pro Lys Gly Val Phe Arg His Leu Gly Gln Glu Ala Ser Met 770 775 780 Met Asp Val Gly Arg Thr Ala Val Val Pro Arg Ala Asp Gly Arg Arg 785 790 795 800 Gly Val Gln Ile His Ser Arg Thr Asn Leu Met Asp Ala Glu Leu Leu 805 810 815 Asp Ala Asp Ser Asp Phe 820 47 2445 DNA Danio rerio 47 aggagctggt tacacacgtg tgtgtagatg tagaacagaa acatgggcaa aagaaggcga 60 tgtaaacagg agattgataa tttaactaag aaacagaaga aacatctaaa ggaatttggt 120 gaacaacacc cttttcacga taaggttgtt gaaagaccgg agaagactca gattttgcgc 180 ttgcctgaca gtccacagcg ccctgaacca gacagtgagg atgacagtga tgctgagcaa 240 ccatctgcat atcagaaact actgtccacc atgattcgag gtgatgaaga tgatgttgag 300 agtgaagatg aagaaagcga ggaagaggat aacgaggagg aagctgaagt tgaaggtgat 360 agtgaagaag aagtggagga caccgatgta gaggaaggga atgatgctga ggaagtttta 420 gaggataaag tggaagaaac atctgacaaa ttacagaaaa aacaccctga aaagaccaat 480 ggagatgtag aggagaatga agaggcagag atggagggag agtttacaga taaaaagaat 540 gaagctgctt tctgcttgga gaccaatatg cctgcagaag gagaggaaaa cacagcacaa 600 caggaacagg atgaagacat gtttgtgaag caccaagaga ttgaactgag cgaagaggaa 660 gtgggacgaa tctcacaagg atcaaaggtc aagactcagg ttaagtgggc aaaactgggt 720 gttctgcagt gcatgagtcc attggaacgt tttccagcca tcggacaggc ctcttctgcc 780 cctcttccac ccattcacaa aaccctagag gcaaattggc atttgttaaa cctgcctttt 840 ggagctgcaa cagatgtcac agagctgcaa aaggagatgc ttgggctcat gggtacttac 900 agagacctct atttggcaaa cctctccccc ttgaaggagg ctaaggaagt gcgcaacgca 960 tactgtcttc atgccctgaa ccatgtatta aaggccaact ctcgtgtgct tagaaacaat 1020 gccaagttaa aggaatctaa aaatggggat gaggattttc gtgaccaagg gcttactaga 1080 ccaaaggtct tgattttggt gccattcaga gatggagcac tgagagttgt ccagaccttc 1140 atgacacttc tggagcccaa aggcaagaag acggatgcca gtaacaagaa acgcttcaag 1200 gaggaatatg gagaggagac ggatgagaag ccacctaatc tgcaaaggcc tgatgactac 1260 cacgctgtct tttctggaaa tatagatgat catttcagga taggtatgtc tatcctgaga 1320 cacagcatgc gcctatacgc tcccttctat tcctctgaca tcatcattgc ttctccgctg 1380 agtctccgca cagttctggg tgctgaaggg gagaagaaaa gagaccatga ctttctgtct 1440 tcaatcgaac ttttgattgt ggatcaggca gatgttttcc tcatgcagaa ctgggaacat 1500 ttactccatg tgttggagca tttgaacctg cagccgctgg actctcatgg tgttgatttc 1560 tctcgcgttc gcatgtggaa cctaaacaac tgggcagctt attacagaca gacgctggtg 1620 ttcagtgcca tacaagaacc tcagatcacc aacatcctca ccaaacactg ccacaactac 1680 agaggccagg tgtgcagtaa aaccattcca aagataggct caatctgtca ggtacttgtg 1740 cagctgccac atgtttttca gatgtttcac tctgacagct tcatggacca ggatgccagg 1800 tcccagttct ttgtggataa gatcctgcct cagtacagag actctgtcat gtccgacact 1860 ttcatctatg tgcgttcgta ctttgatctt gtacgtctgc gaaactacat gaagaaggaa 1920 gacgtcagtt ttgttaacat tagcgagtac tctcagagat cagaggtgtc tcgagcgcga 1980 cattacttcc agaaaggaga aaaacagttt ctgctcttct ctgagagatt ccacttttat 2040 aagagacaca ctatcaaggg tatccacaat ctgatcttct acgggttgcc cacataccct 2100 catttctaca gtgaggtgtg taatatgctt caggctggag tcagagaggg tggtgcttct 2160 gtcagcttca cctgcacggc tctgtactct cgatatgatg tgcaccgtct ggctgccatc 2220 accggggctg accgcgccgc tcagatgctc cagtccaaaa aaaccacaca cctcttcatc 2280 acgggggagg agaaaaacac atgaagaggt actgcgaaaa tctgatgtga cgttttaatc 2340 tctctgagaa ggaagataat ctgtgttggt ggtagtgggt aaaatgatgt attcccttct 2400 gtcatatgac tgaatttaat gatgatattc cctttccctt ttata 2445 48 753 PRT Danio rerio 48 Met Gly Lys Arg Arg Arg Cys Lys Gln Glu Ile Asp Asn Leu Thr Lys 1 5 10 15 Lys Gln Lys Lys His Leu Lys Glu Phe Gly Glu Gln His Pro Phe His 20 25 30 Asp Lys Val Val Glu Arg Pro Glu Lys Thr Gln Ile Leu Arg Leu Pro 35 40 45 Asp Ser Pro Gln Arg Pro Glu Pro Asp Ser Glu Asp Asp Ser Asp Ala 50 55 60 Glu Gln Pro Ser Ala Tyr Gln Lys Leu Leu Ser Thr Met Ile Arg Gly 65 70 75 80 Asp Glu Asp Asp Val Glu Ser Glu Asp Glu Glu Ser Glu Glu Glu Asp 85 90 95 Asn Glu Glu Glu Ala Glu Val Glu Gly Asp Ser Glu Glu Glu Val Glu 100 105 110 Asp Thr Asp Val Glu Glu Gly Asn Asp Ala Glu Glu Val Leu Glu Asp 115 120 125 Lys Val Glu Glu Thr Ser Asp Lys Leu Gln Lys Lys His Pro Glu Lys 130 135 140 Thr Asn Gly Asp Val Glu Glu Asn Glu Glu Ala Glu Met Glu Gly Glu 145 150 155 160 Phe Thr Asp Lys Lys Asn Glu Ala Ala Phe Cys Leu Glu Thr Asn Met 165 170 175 Pro Ala Glu Gly Glu Glu Asn Thr Ala Gln Gln Glu Gln Asp Glu Asp 180 185 190 Met Phe Val Lys His Gln Glu Ile Glu Leu Ser Glu Glu Glu Val Gly 195 200 205 Arg Ile Ser Gln Gly Ser Lys Val Lys Thr Gln Val Lys Trp Ala Lys 210 215 220 Leu Gly Val Leu Gln Cys Met Ser Pro Leu Glu Arg Phe Pro Ala Ile 225 230 235 240 Gly Gln Ala Ser Ser Ala Pro Leu Pro Pro Ile His Lys Thr Leu Glu 245 250 255 Ala Asn Trp His Leu Leu Asn Leu Pro Phe Gly Ala Ala Thr Asp Val 260 265 270 Thr Glu Leu Gln Lys Glu Met Leu Gly Leu Met Gly Thr Tyr Arg Asp 275 280 285 Leu Tyr Leu Ala Asn Leu Ser Pro Leu Lys Glu Ala Lys Glu Val Arg 290 295 300 Asn Ala Tyr Cys Leu His Ala Leu Asn His Val Leu Lys Ala Asn Ser 305 310 315 320 Arg Val Leu Arg Asn Asn Ala Lys Leu Lys Glu Ser Lys Asn Gly Asp 325 330 335 Glu Asp Phe Arg Asp Gln Gly Leu Thr Arg Pro Lys Val Leu Ile Leu 340 345 350 Val Pro Phe Arg Asp Gly Ala Leu Arg Val Val Gln Thr Phe Met Thr 355 360 365 Leu Leu Glu Pro Lys Gly Lys Lys Thr Asp Ala Ser Asn Lys Lys Arg 370 375 380 Phe Lys Glu Glu Tyr Gly Glu Glu Thr Asp Glu Lys Pro Pro Asn Leu 385 390 395 400 Gln Arg Pro Asp Asp Tyr His Ala Val Phe Ser Gly Asn Ile Asp Asp 405 410 415 His Phe Arg Ile Gly Met Ser Ile Leu Arg His Ser Met Arg Leu Tyr 420 425 430 Ala Pro Phe Tyr Ser Ser Asp Ile Ile Ile Ala Ser Pro Leu Ser Leu 435 440 445 Arg Thr Val Leu Gly Ala Glu Gly Glu Lys Lys Arg Asp His Asp Phe 450 455 460 Leu Ser Ser Ile Glu Leu Leu Ile Val Asp Gln Ala Asp Val Phe Leu 465 470 475 480 Met Gln Asn Trp Glu His Leu Leu His Val Leu Glu His Leu Asn Leu 485 490 495 Gln Pro Leu Asp Ser His Gly Val Asp Phe Ser Arg Val Arg Met Trp 500 505 510 Asn Leu Asn Asn Trp Ala Ala Tyr Tyr Arg Gln Thr Leu Val Phe Ser 515 520 525 Ala Ile Gln Glu Pro Gln Ile Thr Asn Ile Leu Thr Lys His Cys His 530 535 540 Asn Tyr Arg Gly Gln Val Cys Ser Lys Thr Ile Pro Lys Ile Gly Ser 545 550 555 560 Ile Cys Gln Val Leu Val Gln Leu Pro His Val Phe Gln Met Phe His 565 570 575 Ser Asp Ser Phe Met Asp Gln Asp Ala Arg Ser Gln Phe Phe Val Asp 580 585 590 Lys Ile Leu Pro Gln Tyr Arg Asp Ser Val Met Ser Asp Thr Phe Ile 595 600 605 Tyr Val Arg Ser Tyr Phe Asp Leu Val Arg Leu Arg Asn Tyr Met Lys 610 615 620 Lys Glu Asp Val Ser Phe Val Asn Ile Ser Glu Tyr Ser Gln Arg Ser 625 630 635 640 Glu Val Ser Arg Ala Arg His Tyr Phe Gln Lys Gly Glu Lys Gln Phe 645 650 655 Leu Leu Phe Ser Glu Arg Phe His Phe Tyr Lys Arg His Thr Ile Lys 660 665 670 Gly Ile His Asn Leu Ile Phe Tyr Gly Leu Pro Thr Tyr Pro His Phe 675 680 685 Tyr Ser Glu Val Cys Asn Met Leu Gln Ala Gly Val Arg Glu Gly Gly 690 695 700 Ala Ser Val Ser Phe Thr Cys Thr Ala Leu Tyr Ser Arg Tyr Asp Val 705 710 715 720 His Arg Leu Ala Ala Ile Thr Gly Ala Asp Arg Ala Ala Gln Met Leu 725 730 735 Gln Ser Lys Lys Thr Thr His Leu Phe Ile Thr Gly Glu Glu Lys Asn 740 745 750 Thr 49 701 DNA Danio rerio 49 aggcagtggt aacaacgcag agtacgcggg attcatatca gcaatggcgt cccagcagat 60 gagcactaac aatgctcaga tgagcgctca gcaagcggca gttttacagc aaacacaagc 120 tcaacagctc agtcaacagc aagatttcga ccccgttcac agattcaaga tgctaattcc 180 accgctcaaa gacagtctac agaatgtcat gaccatcgcg tctctaaatt ttgcacataa 240 tactgctatt gacaacggct taaaaacaac cgagaagggt aatgatgccg cagttcagcg 300 gtttgacaaa agtctagagg agttttatgc tctctgtgat cagctggagc tctgcctgcg 360 gctggcccat gagtgcctct cacagagcat cgacagcacg aaacactctc caaatctcgt 420 acccacagcc actaaacctg acacggtgca aacagaatct ctgtcttact ctcagtatct 480 cagcatgatc aaatcacaga tttcatgtgc taaagacatc cataacgctc tcttagagtg 540 ttcaaaaaag atcgctggga aaggtcaggg agcttgtaat tagttgctgc caaatctatt 600 tcgtttcccc tttgtgtttg tgtatgcata actttgatga tgatcaaaaa agtttttttt 660 tttttttggg gggccccctg gggggttaat ttaaaaaaaa a 701 50 179 PRT Danio rerio 50 Met Ala Ser Gln Gln Met Ser Thr Asn Asn Ala Gln Met Ser Ala Gln 1 5 10 15 Gln Ala Ala Val Leu Gln Gln Thr Gln Ala Gln Gln Leu Ser Gln Gln 20 25 30 Gln Asp Phe Asp Pro Val His Arg Phe Lys Met Leu Ile Pro Pro Leu 35 40 45 Lys Asp Ser Leu Gln Asn Val Met Thr Ile Ala Ser Leu Asn Phe Ala 50 55 60 His Asn Thr Ala Ile Asp Asn Gly Leu Lys Thr Thr Glu Lys Gly Asn 65 70 75 80 Asp Ala Ala Val Gln Arg Phe Asp Lys Ser Leu Glu Glu Phe Tyr Ala 85 90 95 Leu Cys Asp Gln Leu Glu Leu Cys Leu Arg Leu Ala His Glu Cys Leu 100 105 110 Ser Gln Ser Ile Asp Ser Thr Lys His Ser Pro Asn Leu Val Pro Thr 115 120 125 Ala Thr Lys Pro Asp Thr Val Gln Thr Glu Ser Leu Ser Tyr Ser Gln 130 135 140 Tyr Leu Ser Met Ile Lys Ser Gln Ile Ser Cys Ala Lys Asp Ile His 145 150 155 160 Asn Ala Leu Leu Glu Cys Ser Lys Lys Ile Ala Gly Lys Gly Gln Gly 165 170 175 Ala Cys Asn 51 2145 DNA Danio rerio 51 ggcacgagct gagcttgtaa atcagaaaca aaactgtaag tcggtcggct gcacggttcg 60 attcaagtct ctccactgca gtacaaacat gttcacgcaa gaggttaggt aaagacagcc 120 cgaaacatct cgacatccca aaacgaaccc gacagcgtcc acagaagaac tttgaggacg 180 cctccgactt cgctttgcat tttccagcat gatgtcacaa gcagatgcag acatcacgcc 240 cttcttcgct gatgataatg aaggggaggg gcctgttgaa aatggggtgg tgtcacctct 300 gcctgaggac gaggaggagg agtctcccag cggagtcacc gataggagag ccataatgac 360 tgtgattgtg ctctgctaca tcaacctgct gaactatatg gacagattca ctgtggcagg 420 tgtacttcca gacattgagc atttcttcgg cattggtgat ggaacgtcag ggctgcttca 480 aacagtcttc atctgcagct acatgttctt ggctcctctg ttcggatacc tgggtgaccg 540 gtataacagg aagctgatca tgtgtgtggg gattttcttc tggtctgtgg taacacttgc 600 cagctcattt atcgggaaag atcacttctg ggcactgctg ttgacacgag ggctagtggg 660 tgttggagag gccagttatt ccaccattgc tcccaccatc atcgctgacc tctttgtcaa 720 ggaaaagagg actaacatgc tgtctatatt ttattttgcc atcccagtgg gcagtggtat 780 gggttatatc gtgggctcaa aggtggatac tgtggctaaa gattggcatt gggcacttcg 840 ggtgacccct gggctgggac tgctggccgt gttcctctta atgcttgtgg ttcaggaacc 900 gaagcgtggt gcgattgaag ctcaccccga gcacacgctg cacagaacca gctggctggc 960 agacatgaaa gcactttgca ggaatccagg ttttattctg tccacgttcg gcttcacggc 1020 agtggccttt gtgacgggat cactggccct gtgggctcct gcctttttat ttagagccgg 1080 cgtcttcacc ggagtcaagc agccgtgttt taaagctcca tgtgatgact ctgacagtct 1140 gatttttggc gccatcactg tagtaacggg catcttgggt gtagcgagcg gtgttcaggc 1200 cagtaaactg cagaggacca gaactcctcg agctgacccg ctagtgtgtg cagctggact 1260 cctcctcgcg gctccctttc tctatctctc catcatgttt gcacaggcga gcactgtggc 1320 cacatatgtc tttatcttcc tgggagagac attcctttca atgaattggg ccattgttgc 1380 tgatatccta ctgtatgttg tgatcccgac acgtcgctcc acagcagagg cctttcagat 1440 tgtcctttca cacttgctgg gtgacgctat tagtccctat ctcataggag tggtttcaga 1500 ctcaataaaa gaatcaaatt cgtacatgtg ggagtttcgc tcacttcaga tgtctctgct 1560 gctgtgctct ttcgtgcccg tcgctggagg cgctttcttc ttagccacag ccgtctttat 1620 tgagaaagac cgagatctcg ctgagaacta tgtgccctct gatgacgcac caattgttgt 1680 tccaaggagc ggcagatcca ctaaagtatc agtgtcgagt gttttaattt gagatgacgc 1740 aaaagagtgg atggctcttg acatgcaggg aacagcactc tgattggcta aggcagacag 1800 aggcagtgtc atcagcagcc aatcctggag cagtatcaaa caagtaatga ctggactatc 1860 tcctcaatag gaaattttat agtaatttat taagcttttt ttgttttgtt tatatcctta 1920 acaccttgat ttgggaacat ttctaactaa caagaaaatc agaaacctct tacctgaaaa 1980 ccctagtgtg ccgctggacc actttgttgt ttcacctgga ggagacagaa gcacaagggt 2040 aaaacaagac aaccggggac atttatttgc ctttaattaa aactgtgttt tagacttcaa 2100 gtcgcatttt acgttttttt ttatcagaag ggatgtgaac taaat 2145 52 507 PRT Danio rerio 52 Met Met Ser Gln Ala Asp Ala Asp Ile Thr Pro Phe Phe Ala Asp Asp 1 5 10 15 Asn Glu Gly Glu Gly Pro Val Glu Asn Gly Val Val Ser Pro Leu Pro 20 25 30 Glu Asp Glu Glu Glu Glu Ser Pro Ser Gly Val Thr Asp Arg Arg Ala 35 40 45 Ile Met Thr Val Ile Val Leu Cys Tyr Ile Asn Leu Leu Asn Tyr Met 50 55 60 Asp Arg Phe Thr Val Ala Gly Val Leu Pro Asp Ile Glu His Phe Phe 65 70 75 80 Gly Ile Gly Asp Gly Thr Ser Gly Leu Leu Gln Thr Val Phe Ile Cys 85 90 95 Ser Tyr Met Phe Leu Ala Pro Leu Phe Gly Tyr Leu Gly Asp Arg Tyr 100 105 110 Asn Arg Lys Leu Ile Met Cys Val Gly Ile Phe Phe Trp Ser Val Val 115 120 125 Thr Leu Ala Ser Ser Phe Ile Gly Lys Asp His Phe Trp Ala Leu Leu 130 135 140 Leu Thr Arg Gly Leu Val Gly Val Gly Glu Ala Ser Tyr Ser Thr Ile 145 150 155 160 Ala Pro Thr Ile Ile Ala Asp Leu Phe Val Lys Glu Lys Arg Thr Asn 165 170 175 Met Leu Ser Ile Phe Tyr Phe Ala Ile Pro Val Gly Ser Gly Met Gly 180 185 190 Tyr Ile Val Gly Ser Lys Val Asp Thr Val Ala Lys Asp Trp His Trp 195 200 205 Ala Leu Arg Val Thr Pro Gly Leu Gly Leu Leu Ala Val Phe Leu Leu 210 215 220 Met Leu Val Val Gln Glu Pro Lys Arg Gly Ala Ile Glu Ala His Pro 225 230 235 240 Glu His Thr Leu His Arg Thr Ser Trp Leu Ala Asp Met Lys Ala Leu 245 250 255 Cys Arg Asn Pro Gly Phe Ile Leu Ser Thr Phe Gly Phe Thr Ala Val 260 265 270 Ala Phe Val Thr Gly Ser Leu Ala Leu Trp Ala Pro Ala Phe Leu Phe 275 280 285 Arg Ala Gly Val Phe Thr Gly Val Lys Gln Pro Cys Phe Lys Ala Pro 290 295 300 Cys Asp Asp Ser Asp Ser Leu Ile Phe Gly Ala Ile Thr Val Val Thr 305 310 315 320 Gly Ile Leu Gly Val Ala Ser Gly Val Gln Ala Ser Lys Leu Gln Arg 325 330 335 Thr Arg Thr Pro Arg Ala Asp Pro Leu Val Cys Ala Ala Gly Leu Leu 340 345 350 Leu Ala Ala Pro Phe Leu Tyr Leu Ser Ile Met Phe Ala Gln Ala Ser 355 360 365 Thr Val Ala Thr Tyr Val Phe Ile Phe Leu Gly Glu Thr Phe Leu Ser 370 375 380 Met Asn Trp Ala Ile Val Ala Asp Ile Leu Leu Tyr Val Val Ile Pro 385 390 395 400 Thr Arg Arg Ser Thr Ala Glu Ala Phe Gln Ile Val Leu Ser His Leu 405 410 415 Leu Gly Asp Ala Ile Ser Pro Tyr Leu Ile Gly Val Val Ser Asp Ser 420 425 430 Ile Lys Glu Ser Asn Ser Tyr Met Trp Glu Phe Arg Ser Leu Gln Met 435 440 445 Ser Leu Leu Leu Cys Ser Phe Val Pro Val Ala Gly Gly Ala Phe Phe 450 455 460 Leu Ala Thr Ala Val Phe Ile Glu Lys Asp Arg Asp Leu Ala Glu Asn 465 470 475 480 Tyr Val Pro Ser Asp Asp Ala Pro Ile Val Val Pro Arg Ser Gly Arg 485 490 495 Ser Thr Lys Val Ser Val Ser Ser Val Leu Ile 500 505 53 2213 DNA Danio rerio 53 tttttttaaa aaagcagaaa ctaagagaat aaagtcgtga cttatttgct gtttgttact 60 tttgctcgct ttttattcgg tatacactgc gggattgttt cgctcgttgt tcttttcgga 120 ggcccggttt tttattactc aagaaaatcc agctcgcgag acccgagcgt ctgaaagaag 180 ttaaacacag aaagactccg catccccatg aggcaaccag aagtgccaat cgaagagatt 240 tcgtgactga accttttgtt ttgtattcat ttttggtgct tcattacttt agcagtcagc 300 gatgaagatg atcgttgtgt ttacagtatg tatgtctgtg gtggttttgg cgtcggctca 360 ggctgatcag aagtcaaaga actgcaatga agtcagaact gcctacagtt ccaaaggttt 420 caacgtcaac gatgttccca acaaaggagt gcagggagcc catctgaagg tgtgtcctca 480 aggtttctcc tgctgtactc tggagatgga ggagaagttg agtcagctga gccgcgtaga 540 cctgaaagtt cccgttcacc agctcagctc caacctgcag agtaccttca cccagagaca 600 ccgtcacttt gaccagttct tcagagagct gttggacaat gcagagaagt ctctcaatca 660 catgtttgtg cggacgtatg gcttgatgta cgtcaagaac aaagagctct ttgagggttt 720 cttcagtgat ctgcgccgct actacagcca tggcagcagt gaggtcaatc tggacgacat 780 gctggccgag ttctggtctg aactgctgga gcgcatgttt cgactggtca acgtccagta 840 cgagttcagc gactcctaca tggagtgtgt cagcaggcac acggaccagc tcaaaccctt 900 cggagacgtt ccacgcaagc tccgtctgca gttgacccgc tccttcattg ctgttcgcgc 960 cttcactcgt ggcctcacac tcatgcctga tgtggtccgc aaagtctcaa cggtgagcgc 1020 ctcaccgagc tgtgtgcgcg catcgatgaa gatgttatac tgcccgtact gtagcggtca 1080 ggtggcgctg aagccctgca agaactactg tctgaatgtc atgaggggct gtctggccaa 1140 tcaggccgat ctggacacag agtggaataa cttccttgac tccatgttag gtctcgctga 1200 gaggctggag ggacccttta acttcgagtc ggtcgtggac cccatcgatg tgaagatttc 1260 agatgctatc atgaacatgc aggagaacag catgcaggtc tcacagaagg tgtttcaggg 1320 ttgtgggcag cccaaactca gcatgggctt ccgcagccgg aggtctgcta aagactcggg 1380 cttccctggc cgttttagac cttacagccc tgaggccaga cccaccactg ctgctggcac 1440 caccctggac cgactgctga cggatgtcaa gaagaaactg aagcatgcca agaagttctg 1500 gtccaccctg ccggacacag tgtgtgttgg agaaaggatc gcccccagtg atgattgctg 1560 gaatggtaca gctaaaagca ggtacgaatc tgtggttatg agcagcggtt tagccaatca 1620 ggtgtctaat ccagacgtgg aggttgacat caccaaacca gacatggtga tccggcggca 1680 gatcgcagtc ctgaaggaaa tgaccagctg gctgaaagct gcttacaccg gcaacgacat 1740 ctcatttacc tccgatgatg ccggcagcgg agaggaaagc ggaagcggct gcgactctcc 1800 ctcttgcgaa ggcgacggag atatttactt ctccaccccg gcacctggta aacctcgcat 1860 cctcaagact ccggaggtcg aacaggcctc cagcggcaca cggttggcac cctgcagcct 1920 ggcactggcc ctggcctccc ttctgctcac cctgcttact ctccaaacaa gataaaagca 1980 gaatcagccc aagaaagagc aagtgattgg acaaatgttt ttgaccgcga gagcagaaaa 2040 ctgccacagg acattaaaga cactgctttc aggatctcca tctgccttcc atccggggaa 2100 gacactcaag ttattaatat tcacacaact tcagtgtatt tttttaatgt ttataagagc 2160 ggactccatt ttgtgtacag ctatacggct ccatgcagca ttgcgtcgtc tgg 2213 54 557 PRT Danio rerio 54 Met Lys Met Ile Val Val Phe Thr Val Cys Met Ser Val Val Val Leu 1 5 10 15 Ala Ser Ala Gln Ala Asp Gln Lys Ser Lys Asn Cys Asn Glu Val Arg 20 25 30 Thr Ala Tyr Ser Ser Lys Gly Phe Asn Val Asn Asp Val Pro Asn Lys 35 40 45 Gly Val Gln Gly Ala His Leu Lys Val Cys Pro Gln Gly Phe Ser Cys 50 55 60 Cys Thr Leu Glu Met Glu Glu Lys Leu Ser Gln Leu Ser Arg Val Asp 65 70 75 80 Leu Lys Val Pro Val His Gln Leu Ser Ser Asn Leu Gln Ser Thr Phe 85 90 95 Thr Gln Arg His Arg His Phe Asp Gln Phe Phe Arg Glu Leu Leu Asp 100 105 110 Asn Ala Glu Lys Ser Leu Asn His Met Phe Val Arg Thr Tyr Gly Leu 115 120 125 Met Tyr Val Lys Asn Lys Glu Leu Phe Glu Gly Phe Phe Ser Asp Leu 130 135 140 Arg Arg Tyr Tyr Ser His Gly Ser Ser Glu Val Asn Leu Asp Asp Met 145 150 155 160 Leu Ala Glu Phe Trp Ser Glu Leu Leu Glu Arg Met Phe Arg Leu Val 165 170 175 Asn Val Gln Tyr Glu Phe Ser Asp Ser Tyr Met Glu Cys Val Ser Arg 180 185 190 His Thr Asp Gln Leu Lys Pro Phe Gly Asp Val Pro Arg Lys Leu Arg 195 200 205 Leu Gln Leu Thr Arg Ser Phe Ile Ala Val Arg Ala Phe Thr Arg Gly 210 215 220 Leu Thr Leu Met Pro Asp Val Val Arg Lys Val Ser Thr Val Ser Ala 225 230 235 240 Ser Pro Ser Cys Val Arg Ala Ser Met Lys Met Leu Tyr Cys Pro Tyr 245 250 255 Cys Ser Gly Gln Val Ala Leu Lys Pro Cys Lys Asn Tyr Cys Leu Asn 260 265 270 Val Met Arg Gly Cys Leu Ala Asn Gln Ala Asp Leu Asp Thr Glu Trp 275 280 285 Asn Asn Phe Leu Asp Ser Met Leu Gly Leu Ala Glu Arg Leu Glu Gly 290 295 300 Pro Phe Asn Phe Glu Ser Val Val Asp Pro Ile Asp Val Lys Ile Ser 305 310 315 320 Asp Ala Ile Met Asn Met Gln Glu Asn Ser Met Gln Val Ser Gln Lys 325 330 335 Val Phe Gln Gly Cys Gly Gln Pro Lys Leu Ser Met Gly Phe Arg Ser 340 345 350 Arg Arg Ser Ala Lys Asp Ser Gly Phe Pro Gly Arg Phe Arg Pro Tyr 355 360 365 Ser Pro Glu Ala Arg Pro Thr Thr Ala Ala Gly Thr Thr Leu Asp Arg 370 375 380 Leu Leu Thr Asp Val Lys Lys Lys Leu Lys His Ala Lys Lys Phe Trp 385 390 395 400 Ser Thr Leu Pro Asp Thr Val Cys Val Gly Glu Arg Ile Ala Pro Ser 405 410 415 Asp Asp Cys Trp Asn Gly Thr Ala Lys Ser Arg Tyr Glu Ser Val Val 420 425 430 Met Ser Ser Gly Leu Ala Asn Gln Val Ser Asn Pro Asp Val Glu Val 435 440 445 Asp Ile Thr Lys Pro Asp Met Val Ile Arg Arg Gln Ile Ala Val Leu 450 455 460 Lys Glu Met Thr Ser Trp Leu Lys Ala Ala Tyr Thr Gly Asn Asp Ile 465 470 475 480 Ser Phe Thr Ser Asp Asp Ala Gly Ser Gly Glu Glu Ser Gly Ser Gly 485 490 495 Cys Asp Ser Pro Ser Cys Glu Gly Asp Gly Asp Ile Tyr Phe Ser Thr 500 505 510 Pro Ala Pro Gly Lys Pro Arg Ile Leu Lys Thr Pro Glu Val Glu Gln 515 520 525 Ala Ser Ser Gly Thr Arg Leu Ala Pro Cys Ser Leu Ala Leu Ala Leu 530 535 540 Ala Ser Leu Leu Leu Thr Leu Leu Thr Leu Gln Thr Arg 545 550 555 55 2688 DNA Danio rerio 55 ctgacggagc agctgagaag ccggccagca gtgcagccct ttgtccgcta aataaacctg 60 atcattttta ttaaaaagtg tacagttttt cttctcgctg gcctttctaa ctttaccaga 120 tacctcaatc atgcacgtga tcaagagaga tgggcgccag gagcgtgtaa tgtttgacaa 180 aatcacatca cgcatccaga aactttgcta tggactcaac tctgactttg ttgatccaac 240 tcagatcacc atgaaggtga ttcagggcct ttacagcggc gttactacag tagagctgga 300 cacactggct gctgagatcg ctgccaccct caccaccaaa caccctgact atgccatcct 360 agcagcccgc atcgccgtgt ctaacctgca taaagagact aaaaaggttt tcagtgaggt 420 tatggaggat ctctacaact acgtcaatcc cctgaacagc cgccactcgc ctatgatctc 480 caaggagacg cttgatattg ttttggccaa caaagatcgc ctgaactcag ccatcattta 540 tgacagggat ttctcttaca acttctttgg cttcaaaaca ttggagcgct catatctgct 600 gaagatcaat ggaaaggttg ccgagcggcc gcagcacatg ctgatgaggg tgtccgtagg 660 cattcataag gaagacattg cggctgctat cgaaacctac aacctgctgt ctgaaaagtg 720 gttcactcat gcttctccta ctctgtttaa tgcgggcacc aaccggcctc agctctccag 780 ttgcttcctg cttgctatga aagatgacag cattgagggc atctatgaca ccctgaagca 840 gtgtgcgctc atctccaagt ctgccggtgg catcggggtg gcagtgagct gcattagagc 900 cactggcagc tacattgcag gaacaaatgg taactcaaat ggcctggttc ctatgctccg 960 tgtctacaac aacaccgcac gctatgtgga ccaaggggga aataagagac caggagcttt 1020 tgccatgtac ctggagcctt ggcattttga cattttcgac ttcctagagc ttaaaaaaaa 1080 cactggtaaa gaggagcaga gagccagaga tctgttttac gccctctgga ttcctgatct 1140 cttcatgaag cgagtggaga ccaatgggga ttggtctctg atgtgcccaa atgactgtcc 1200 cggcctggat gagtgctggg gagaggagtt tgagaagctc tatgcaaaat atgagcagga 1260 gggcagagcg aagcgtgtgg ttaaggctca gcagctttgg tatgccatca tcgagtctca 1320 gactgagacc ggcacaccat acatgcttta caaagacgcc tgcaaccgca agagcaacca 1380 gcagaacctg ggcaccatca aatgcagcaa tttgtgcaca gagattgttg agtacaccag 1440 tgccgatgag gtggcagttt gcaacctggc ctccattgca cttaacatgt acgtcacctc 1500 agagcgaaca tttgacttcc agaagttggc ctctgttacg aaggtcattg tgaagaacct 1560 gaacaagatc attgatatca actactaccc agttaaagag gctgagaact ccaacaagcg 1620 ccaccggccc atcgggatcg gcgttcaggg tctggccgat gccttcatcc tcatgcgttt 1680 cccatttgag agtgcggagg ctcaactcct caacacgcag atctttgaga ccatctatta 1740 cgctgctctg gagtcaagct gtgagctcgc tgctgaatac ggcccgtatc aaacctacgc 1800 cggctgtcct gtcagcaagg gtatcttgca gtatgatatg tgggaaaaga cacctacaga 1860 cttgtgggac tgggcagcgc ttaaggagaa gattgcaaag catggggtgc gtaacagtct 1920 gctgctggcc ccaatgccca ctgcctccac agctcagatt ctgggcaaca acgaatccat 1980 tgagccgtac accagcaaca tctacacccg cagggtcctg tctggagagt ttcagatcgt 2040 gaacccgcat cttctcaaag acctcacaga acgaggactc tggaacgagg aaatgaaaaa 2100 ccagatcatc gctcagaacg gatccatcca gaccattccc gcaatccctg atgacctgaa 2160 ggagctgtat aagacagtgt gggagatttc tcagaagacc attttaaaga tggctgcaga 2220 tcgaggagcc tacattgacc agagccagtc gctcaacatc cacatcgctg agccaaacta 2280 cggcaaactc accagcatgc acttctacgg ctggaagcag ggtcttaaaa caggcatgta 2340 ctacctaaga actaaaccgg ctgcaaaccc catccagttc accctcaaca aagagaagct 2400 gaaggagaca cagaaaacga ccagcagtga agacgaggaa accaaagagc gtaataaagc 2460 tgcgatggtg tgttcactgg agaaccgtga cgagtgtctg atgtgcggat cataaactca 2520 actcctgcac tcggacacaa tcaatagcat catttattca tttgcaactt gtacagtatt 2580 aaaccttttt taaatggcca tggtgaagta cagaattatt catattaatt aactttccct 2640 tttgtatgcg aataatatag tgtacatttg ggaattatat ttttggat 2688 56 794 PRT Danio rerio 56 Met His Val Ile Lys Arg Asp Gly Arg Gln Glu Arg Val Met Phe Asp 1 5 10 15 Lys Ile Thr Ser Arg Ile Gln Lys Leu Cys Tyr Gly Leu Asn Ser Asp 20 25 30 Phe Val Asp Pro Thr Gln Ile Thr Met Lys Val Ile Gln Gly Leu Tyr 35 40 45 Ser Gly Val Thr Thr Val Glu Leu Asp Thr Leu Ala Ala Glu Ile Ala 50 55 60 Ala Thr Leu Thr Thr Lys His Pro Asp Tyr Ala Ile Leu Ala Ala Arg 65 70 75 80 Ile Ala Val Ser Asn Leu His Lys Glu Thr Lys Lys Val Phe Ser Glu 85 90 95 Val Met Glu Asp Leu Tyr Asn Tyr Val Asn Pro Leu Asn Ser Arg His 100 105 110 Ser Pro Met Ile Ser Lys Glu Thr Leu Asp Ile Val Leu Ala Asn Lys 115 120 125 Asp Arg Leu Asn Ser Ala Ile Ile Tyr Asp Arg Asp Phe Ser Tyr Asn 130 135 140 Phe Phe Gly Phe Lys Thr Leu Glu Arg Ser Tyr Leu Leu Lys Ile Asn 145 150 155 160 Gly Lys Val Ala Glu Arg Pro Gln His Met Leu Met Arg Val Ser Val 165 170 175 Gly Ile His Lys Glu Asp Ile Ala Ala Ala Ile Glu Thr Tyr Asn Leu 180 185 190 Leu Ser Glu Lys Trp Phe Thr His Ala Ser Pro Thr Leu Phe Asn Ala 195 200 205 Gly Thr Asn Arg Pro Gln Leu Ser Ser Cys Phe Leu Leu Ala Met Lys 210 215 220 Asp Asp Ser Ile Glu Gly Ile Tyr Asp Thr Leu Lys Gln Cys Ala Leu 225 230 235 240 Ile Ser Lys Ser Ala Gly Gly Ile Gly Val Ala Val Ser Cys Ile Arg 245 250 255 Ala Thr Gly Ser Tyr Ile Ala Gly Thr Asn Gly Asn Ser Asn Gly Leu 260 265 270 Val Pro Met Leu Arg Val Tyr Asn Asn Thr Ala Arg Tyr Val Asp Gln 275 280 285 Gly Gly Asn Lys Arg Pro Gly Ala Phe Ala Met Tyr Leu Glu Pro Trp 290 295 300 His Phe Asp Ile Phe Asp Phe Leu Glu Leu Lys Lys Asn Thr Gly Lys 305 310 315 320 Glu Glu Gln Arg Ala Arg Asp Leu Phe Tyr Ala Leu Trp Ile Pro Asp 325 330 335 Leu Phe Met Lys Arg Val Glu Thr Asn Gly Asp Trp Ser Leu Met Cys 340 345 350 Pro Asn Asp Cys Pro Gly Leu Asp Glu Cys Trp Gly Glu Glu Phe Glu 355 360 365 Lys Leu Tyr Ala Lys Tyr Glu Gln Glu Gly Arg Ala Lys Arg Val Val 370 375 380 Lys Ala Gln Gln Leu Trp Tyr Ala Ile Ile Glu Ser Gln Thr Glu Thr 385 390 395 400 Gly Thr Pro Tyr Met Leu Tyr Lys Asp Ala Cys Asn Arg Lys Ser Asn 405 410 415 Gln Gln Asn Leu Gly Thr Ile Lys Cys Ser Asn Leu Cys Thr Glu Ile 420 425 430 Val Glu Tyr Thr Ser Ala Asp Glu Val Ala Val Cys Asn Leu Ala Ser 435 440 445 Ile Ala Leu Asn Met Tyr Val Thr Ser Glu Arg Thr Phe Asp Phe Gln 450 455 460 Lys Leu Ala Ser Val Thr Lys Val Ile Val Lys Asn Leu Asn Lys Ile 465 470 475 480 Ile Asp Ile Asn Tyr Tyr Pro Val Lys Glu Ala Glu Asn Ser Asn Lys 485 490 495 Arg His Arg Pro Ile Gly Ile Gly Val Gln Gly Leu Ala Asp Ala Phe 500 505 510 Ile Leu Met Arg Phe Pro Phe Glu Ser Ala Glu Ala Gln Leu Leu Asn 515 520 525 Thr Gln Ile Phe Glu Thr Ile Tyr Tyr Ala Ala Leu Glu Ser Ser Cys 530 535 540 Glu Leu Ala Ala Glu Tyr Gly Pro Tyr Gln Thr Tyr Ala Gly Cys Pro 545 550 555 560 Val Ser Lys Gly Ile Leu Gln Tyr Asp Met Trp Glu Lys Thr Pro Thr 565 570 575 Asp Leu Trp Asp Trp Ala Ala Leu Lys Glu Lys Ile Ala Lys His Gly 580 585 590 Val Arg Asn Ser Leu Leu Leu Ala Pro Met Pro Thr Ala Ser Thr Ala 595 600 605 Gln Ile Leu Gly Asn Asn Glu Ser Ile Glu Pro Tyr Thr Ser Asn Ile 610 615 620 Tyr Thr Arg Arg Val Leu Ser Gly Glu Phe Gln Ile Val Asn Pro His 625 630 635 640 Leu Leu Lys Asp Leu Thr Glu Arg Gly Leu Trp Asn Glu Glu Met Lys 645 650 655 Asn Gln Ile Ile Ala Gln Asn Gly Ser Ile Gln Thr Ile Pro Ala Ile 660 665 670 Pro Asp Asp Leu Lys Glu Leu Tyr Lys Thr Val Trp Glu Ile Ser Gln 675 680 685 Lys Thr Ile Leu Lys Met Ala Ala Asp Arg Gly Ala Tyr Ile Asp Gln 690 695 700 Ser Gln Ser Leu Asn Ile His Ile Ala Glu Pro Asn Tyr Gly Lys Leu 705 710 715 720 Thr Ser Met His Phe Tyr Gly Trp Lys Gln Gly Leu Lys Thr Gly Met 725 730 735 Tyr Tyr Leu Arg Thr Lys Pro Ala Ala Asn Pro Ile Gln Phe Thr Leu 740 745 750 Asn Lys Glu Lys Leu Lys Glu Thr Gln Lys Thr Thr Ser Ser Glu Asp 755 760 765 Glu Glu Thr Lys Glu Arg Asn Lys Ala Ala Met Val Cys Ser Leu Glu 770 775 780 Asn Arg Asp Glu Cys Leu Met Cys Gly Ser 785 790 57 3662 DNA Danio rerio 57 aactattttg ccatttcttg tggtctactc ttaagcaaga tcggcttaac ggttacattt 60 aaagggcatt tgagatctgg agcctagaag acgctccttc aacatggcat catcacaagt 120 accagcagcc aaaaaagatg agaagggcag aaacatacaa gtggttgtac gatgcagacc 180 ctttaacaca gtggagcgta aatctggctc tcacactgtt gttgaatgtg accagaaccg 240 gaaagaggtg attatgcgta ctggaggtgc cacagacaaa gcagcaagaa aaacatacac 300 ttttgatatg gtttttggcc cttctgccaa acaaattgaa gtttatagga gtgtggtttg 360 ccccatatta gatgaagtta tcatgggcta taactgcaca atctttgcat atgggcaaac 420 cggaacaggc aaaaccttca caatggaagg tgaccgatca cccaatgagg agtttacttg 480 tgaagaggac cctctggctg gaataattgc caggactctt catcagatct ttgaaaaact 540 gtctaataat ggcacagagt tctcagtgaa ggtgtctctg ctggaaattt ataacgagga 600 actgtttgac ctgctcagcc ctgctgctga tgtcacagag agattacagc tcgtgtgtga 660 tcctagaaat aaaaagggtg tgaccattaa aggtctggag gaaatcaccg cacacaataa 720 gaatgaggtg tatcagatcc tggagagagg agcagccaag aggaagactg cctccacact 780 catgaatgcc tattccagtc gatcccactc tgtgttctct gtcactattc acatgaagga 840 aatcacactg gatggggaag agctagttaa gattggtaaa ctcaacttgg tggatcttgc 900 aggtagtgag aacatcggac gatctggtgc tgtggataag cgtgcacgtg aagctggcaa 960 catcaaccag tctctgctaa ctctgggtcg tgtaatcaag gctttggtgg agagagggcc 1020 tcacgtgccc tacagagagt ctaaactcac ccgcatactg caggactcgc taggaggacg 1080 caccaaaacc tctatcatcg ccactgtgtc tcctgcctcc atcaatctgg aggaaactct 1140 gagcacttta gactatgcta acagagctaa gagcatcatg aataagcctg aggtcaatca 1200 aaggcttacc aagagaaccc tcatcaagga atacacagag gaaattgaga gactgaagag 1260 agatttggct gccacccgtg acaaacatgg agtgtacctt tccgttgata actatgagac 1320 cttgaatggt aaaattgtgt ctcaggaaga gcagattaca gagtacactg agcggattgc 1380 tgctacggag gaggagctca aaaagattat agacctgttc acagatagta agcagaaatt 1440 ggaacagtgc actgaggacc tgcaagacaa aaaccagaga ctggaggagg ctcacaaaga 1500 cctatcggag accaggcgcc gcctcaatca ggaggaattc atcagtacac agcttcaaac 1560 caatgagagt cacctctata acactgctga ccagttgttg agcactgctg aggccagcac 1620 acaggatgtt ggtggtctcc atgctaagct acaaaggaag aaggatgtgg agcttcataa 1680 cagtaagggt cacgagagct tctcccagtg catggagaac tgctacaaca gcatgcagac 1740 ctcacttgaa ggagcagaac ccaaaacatg ctgctatgat tgattattac cgctcctctg 1800 tgggtgagct gcttgaaccc aatggcaagg aaggagcaga gcccaaaacc atgctgttat 1860 gattgattat tcccgctcct ctgtgggtga gctgttgaac accaatggca aggtgtttaa 1920 gaagactttg ggtgctgtgt gcgagtctta cagcagtatt aaaggagctg tggaagaagg 1980 agtagagcgg tgtaaggagc aagtgttaaa tcaggagaaa ctctctcagg atgctcaaaa 2040 cagcatcctg gaaattctgg atgaacacaa acagcatctt gaggaggttc tggttgccca 2100 ggcggtgcca ggtatcaggt ctgtcatgtc catgaatgac aatttgaagc aaacccttca 2160 caaataccac aatctggctg agcagatgca gggtgtgaaa gcagatatga tgacgttttt 2220 tgatgcgtac actgaatcac tagccagtat gcgagagtgc gctttgcagg gttttgacac 2280 actgcgtgcc gaacatgaca aactcaaaca acaaatcagc caagctggaa acagccatca 2340 agtgcgtgtg gccgagctgg ttcagtgttt gcagaaccaa atgaatctgc tggcggtgga 2400 cactcagaat gactttgagg gtctttcaca agcagcatct gcccagattc ctccacttgg 2460 aaacattaca gagctccata gagagtaaat gcacagtggc tgaggagcag gctgtgtctg 2520 ttcgtgctcg actgggttcc tctgttcatg gagtgatttc agagatgaat agtgtgacta 2580 aggagggaga gagggcgctg gaggagtgtg ctggttattg tggacacctg cagacatcgc 2640 tggactcgct ggctgagtca ggactcaagt ggtgcgatga agctaaaggc ttgactgaga 2700 gtaaagccca ggagcaactc aaactcatca gacagacaga cacggctgtg caagacctgc 2760 tgaagtctgt ggaggaaaag ggtgagaaag ctgttcagga ctgtgaagcc agattgggtc 2820 aaatgcagca ggaaatggaa gcaaccctgg gtcgtgtgga aatgcagacc agcaaagatg 2880 aagcgacatt gcaggagcac agagagaccc tgtcctccat caacactcag gcactggaca 2940 ccgtacacaa cttcatcagc tcagagctac ggcaggattt accaacagga acgacccctc 3000 agcgcaaaga gtacatgtac ccacgtgtgc tgagcaggcc gaggagtaga gagtagctgg 3060 aagaggagtt cagagctcag caagagcagc ttcagtctga actgaagccg tgtgagatcg 3120 taatggaggt ggaggaggag aaacctgtcg accaggactc tctggaggat gacgtcagtg 3180 tttctagtga tggaaacaac actgaacagt cgtgctctga tgagaacctg atatgttatg 3240 agaatggaag gattcctttt ttcaagaaga aaagcaagaa ggaaaatggc agtaaatcac 3300 tgaaccgttc aaaggtggag aatgacagca tgtcaacacc accccgctct aaacttccac 3360 tcagatgtca gaattaacct aaattatgtt gccaatatta ctcccttcgt gtattaatct 3420 tccaagtcct tttgtgtctt aaacgttgca ttttttaact acgcttgtct tttttaaaca 3480 tcgattatca atgtatttgt cttatgtctg tccacacaag tcagttataa cttcagggtg 3540 atatcgtact gttacttttc ctgtttcact agcgtttagg cttgtttgta tttgttcaag 3600 gttttgtatg tttaattaaa gactaaaaga aaatggtcca ctccatctct caaataatat 3660 aa 3662 58 983 PRT Danio rerio 58 Met Ala Ser Ser Gln Val Pro Ala Ala Lys Lys Asp Glu Lys Gly Arg 1 5 10 15 Asn Ile Gln Val Val Val Arg Cys Arg Pro Phe Asn Thr Val Glu Arg 20 25 30 Lys Ser Gly Ser His Thr Val Val Glu Cys Asp Gln Asn Arg Lys Glu 35 40 45 Val Ile Met Arg Thr Gly Gly Ala Thr Asp Lys Ala Ala Arg Lys Thr 50 55 60 Tyr Thr Phe Asp Met Val Phe Gly Pro Ser Ala Lys Gln Ile Glu Val 65 70 75 80 Tyr Arg Ser Val Val Cys Pro Ile Leu Asp Glu Val Ile Met Gly Tyr 85 90 95 Asn Cys Thr Ile Phe Ala Tyr Gly Gln Thr Gly Thr Gly Lys Thr Phe 100 105 110 Thr Met Glu Gly Asp Arg Ser Pro Asn Glu Glu Phe Thr Cys Glu Glu 115 120 125 Asp Pro Leu Ala Gly Ile Ile Ala Arg Thr Leu His Gln Ile Phe Glu 130 135 140 Lys Leu Ser Asn Asn Gly Thr Glu Phe Ser Val Lys Val Ser Leu Leu 145 150 155 160 Glu Ile Tyr Asn Glu Glu Leu Phe Asp Leu Leu Ser Pro Ala Ala Asp 165 170 175 Val Thr Glu Arg Leu Gln Leu Val Cys Asp Pro Arg Asn Lys Lys Gly 180 185 190 Val Thr Ile Lys Gly Leu Glu Glu Ile Thr Ala His Asn Lys Asn Glu 195 200 205 Val Tyr Gln Ile Leu Glu Arg Gly Ala Ala Lys Arg Lys Thr Ala Ser 210 215 220 Thr Leu Met Asn Ala Tyr Ser Ser Arg Ser His Ser Val Phe Ser Val 225 230 235 240 Thr Ile His Met Lys Glu Ile Thr Leu Asp Gly Glu Glu Leu Val Lys 245 250 255 Ile Gly Lys Leu Asn Leu Val Asp Leu Ala Gly Ser Glu Asn Ile Gly 260 265 270 Arg Ser Gly Ala Val Asp Lys Arg Ala Arg Glu Ala Gly Asn Ile Asn 275 280 285 Gln Ser Leu Leu Thr Leu Gly Arg Val Ile Lys Ala Leu Val Glu Arg 290 295 300 Gly Pro His Val Pro Tyr Arg Glu Ser Lys Leu Thr Arg Ile Leu Gln 305 310 315 320 Asp Ser Leu Gly Gly Arg Thr Lys Thr Ser Ile Ile Ala Thr Val Ser 325 330 335 Pro Ala Ser Ile Asn Leu Glu Glu Thr Leu Ser Thr Leu Asp Tyr Ala 340 345 350 Asn Arg Ala Lys Ser Ile Met Asn Lys Pro Glu Val Asn Gln Arg Leu 355 360 365 Thr Lys Arg Thr Leu Ile Lys Glu Tyr Thr Glu Glu Ile Glu Arg Leu 370 375 380 Lys Arg Asp Leu Ala Ala Thr Arg Asp Lys His Gly Val Tyr Leu Ser 385 390 395 400 Val Asp Asn Tyr Glu Thr Leu Asn Gly Lys Ile Val Ser Gln Glu Glu 405 410 415 Gln Ile Thr Glu Tyr Thr Glu Arg Ile Ala Ala Thr Glu Glu Glu Leu 420 425 430 Lys Lys Ile Ile Asp Leu Phe Thr Asp Ser Lys Gln Lys Leu Glu Gln 435 440 445 Cys Thr Glu Asp Leu Gln Asp Lys Asn Gln Arg Leu Glu Glu Ala His 450 455 460 Lys Asp Leu Ser Glu Thr Arg Arg Arg Leu Asn Gln Glu Glu Phe Ile 465 470 475 480 Ser Thr Gln Leu Gln Thr Asn Glu Ser His Leu Tyr Asn Thr Ala Asp 485 490 495 Gln Leu Leu Ser Thr Ala Glu Ala Ser Thr Gln Asp Val Gly Gly Leu 500 505 510 His Ala Lys Leu Gln Arg Lys Lys Asp Val Glu Leu His Asn Ser Lys 515 520 525 Gly His Glu Ser Phe Ser Gln Cys Met Glu Asn Cys Tyr Asn Ser Met 530 535 540 Gln Thr Ser Leu Glu Gly Ala Glu Pro Lys Thr Cys Cys Tyr Asp Met 545 550 555 560 Ile Asp Tyr Ser Arg Ser Ser Val Gly Glu Leu Leu Asn Thr Asn Gly 565 570 575 Lys Val Phe Lys Lys Thr Leu Gly Ala Val Cys Glu Ser Tyr Ser Ser 580 585 590 Ile Lys Gly Ala Val Glu Glu Gly Val Glu Arg Cys Lys Glu Gln Val 595 600 605 Leu Asn Gln Glu Lys Leu Ser Gln Asp Ala Gln Asn Ser Ile Leu Glu 610 615 620 Ile Leu Asp Glu His Lys Gln His Leu Glu Glu Val Leu Val Ala Gln 625 630 635 640 Ala Val Pro Gly Ile Arg Ser Val Met Ser Met Asn Asp Asn Leu Lys 645 650 655 Gln Thr Leu His Lys Tyr His Asn Leu Ala Glu Gln Met Gln Gly Val 660 665 670 Lys Ala Asp Met Met Thr Phe Phe Asp Ala Tyr Thr Glu Ser Leu Ala 675 680 685 Ser Met Arg Glu Cys Ala Leu Gln Gly Phe Asp Thr Leu Arg Ala Glu 690 695 700 His Asp Lys Leu Lys Gln Gln Ile Ser Gln Ala Gly Asn Ser His Gln 705 710 715 720 Val Arg Val Ala Glu Leu Val Gln Cys Leu Gln Asn Gln Met Asn Leu 725 730 735 Leu Ala Val Asp Thr Gln Asn Asp Phe Glu Gly Leu Ser Gln Ala Ala 740 745 750 Ser Ala Gln Ile Pro Pro Leu Gly Asn Ile Thr Glu Leu His Arg Glu 755 760 765 Met Thr Leu Arg Val Phe His Lys Gln His Leu Pro Arg Phe Leu His 770 775 780 Leu Glu Thr Leu Gln Ser Ser Ile Glu Ser Lys Cys Thr Val Ala Glu 785 790 795 800 Glu Gln Ala Val Ser Val Arg Ala Arg Leu Gly Ser Ser Val His Gly 805 810 815 Val Ile Ser Glu Met Asn Ser Val Thr Lys Glu Gly Glu Arg Ala Leu 820 825 830 Glu Glu Cys Ala Gly Tyr Cys Gly His Leu Gln Thr Ser Leu Asp Ser 835 840 845 Leu Ala Glu Ser Gly Leu Lys Trp Cys Asp Glu Ala Lys Gly Leu Thr 850 855 860 Glu Ser Lys Ala Gln Glu Gln Leu Lys Leu Ile Arg Gln Thr Asp Thr 865 870 875 880 Ala Val Gln Asp Leu Leu Lys Ser Val Glu Glu Lys Gly Glu Lys Ala 885 890 895 Val Gln Asp Cys Glu Ala Arg Leu Gly Gln Met Gln Gln Glu Met Glu 900 905 910 Ala Thr Leu Gly Arg Val Glu Met Gln Thr Ser Lys Asp Glu Ala Thr 915 920 925 Leu Gln Glu His Arg Glu Thr Leu Ser Ser Ile Asn Thr Gln Ala Leu 930 935 940 Asp Thr Val His Asn Phe Ile Ser Ser Glu Leu Arg Gln Asp Leu Pro 945 950 955 960 Thr Gly Thr Thr Pro Gln Arg Lys Glu Tyr Met Tyr Pro Arg Val Leu 965 970 975 Ser Arg Pro Arg Ser Arg Glu 980 59 2808 DNA Danio rerio 59 ccacgcgtcc gaaacaaagc ggattgctag cgagctaaca gttaacgtta ccccagtagt 60 atctgtgcac acaggggcat aaaattaaga gtgcgattag gggatgtttt aaaaaccgtg 120 cgcacagtca gggtttaatt agatgtttct ttctgcatta tgtgcaattt tatagtttta 180 atgcatcgta tgtaatagct aactggttag ccctcagtga gacagctttg attcgaggga 240 gcaaaatgaa actaaactat aatggtccgc gttgtaagtg atagttcaat aattaagcga 300 tgtgacgtta tcttagctga ccagctttgt tttattttga cgttattatc atgtaataac 360 caaacaaaat gaatgtacac atctttacga gtcactcaga tagtcaaata atagcgtgtt 420 agtagataac gggttgatta aacatacaaa gaggaggcgt taagtgctaa cgttagtaat 480 cagctaactt aaaacgactg tcatgaaaaa ataaacccca gtgaaagcat ttagggggga 540 aatgttcaat ctgatggcga actgttgcaa ctggttgaaa cgatggagag aacctgcaag 600 gaaggtgaca ttagtaatgg tgggattgga caatgctgga aaaacggcga cagtcagagg 660 aatccaggga gaaagccctt tagatgtagc acctacagtg ggcttttcaa aggtggacct 720 gaagcaggga aagtttgagg tcacaatctt tgacctgggt ggcgggaagc gtattcgtgg 780 catttggaaa aactactact ctgaatctta tggagtggtg tttgttgtgg actccagtga 840 tgtccagaga atccaggaga ccagagacac catggctgag gttctccggc atcctcgtat 900 tgcaggcaaa cctgtcttag tactggctaa taaacaggac caagatgggg caatggctga 960 agcagatatc attgagaccc tgtcattgga gaagctagtc aacgagaata aatgcctttg 1020 ccagattgag ccctgttcag ctgtcttggg ttatggtaaa aaggttgaca aatccatcaa 1080 aaatggcctt aactggctcc taaataacat tgccaaggac tatgaggcaa tttcagaacg 1140 tgtgcaaaaa gatacagccg aacagaaggc tcaggaagaa caagacaaga aagaaagagc 1200 agagagagtt cggcggatca gagaagagag agaccgacag gaacgggagg aagcagagcg 1260 agaaggaagg acattaaaag aggaagagct agatgatgtc aacatgttca acccctttca 1320 accaataaac aatgttttga ctgagaacca agacagacta aacagagaaa aagagatgca 1380 aagacagagg gaaaatggcc agcagggcag tgtgcaagaa cagatagcac ttcaagatga 1440 ggaggaagaa gaggaagatg aagagagtga gagacagact ccagaaagca cagaatcagg 1500 agcagtggac cagaccaaaa agaaaactag gaaactacgg ttgaaacgca aacacagagt 1560 cgaccctctt aggatggaag aagcagcacc caaaagcccc acgcctcctc ctctgccagt 1620 tggatgggca actcctaaag tttctaggct ccccaaactt gagcctcttg gagatacaag 1680 gcattctgat ttctatggca aacctctccc acctgtagca attcggcaga ggccgaacag 1740 cgatactcat gatgtcattt cctaacacta cctctacttc cccaaccccc ccttcactgg 1800 tttacagttc ttaaggttcc ctgccaatca aagtattgtt ttgttcttat cttcaagcac 1860 gggcagtgta gcaagctacg cagatgtaat tgcccataag agaggaactt gatacccaat 1920 ctgcattccc agatggatca caagaactcc agcaaccagt gactgatagt cagcagatgg 1980 acattcactt catgacatgt tcagaaaacg tagaatggtt caatatttag gattttgttg 2040 agctgtttgt tttatccaaa tactttaaat cagtattaac agacttttga aattaagtat 2100 gtttttaatg cgttgcaaaa atgcattaca gacctatttt ttaagataac tgaaaaatat 2160 gtacatacaa aaagtttaaa tttgtgtact tgaaattgtt cttctggtcc acatgcatac 2220 cttattattc tgcgcatgta atgtgcaatc gattgtgaca cagtttgcct ttaaatgatc 2280 agaaatgttt cttttttctc tctcagcata tagtttcata cactgtgctg tcgtaatttt 2340 gagaatgaca ttcatactat ctagtagcac cagcaacaga taccgagatc ttgataaagt 2400 gaactttttg catgagatgt gaaacatttt tctacaagct cgttattgag ttttattgtc 2460 acatttggta catcatgctg tatcagggag gaaatggtca ttgaacgata atttcatcaa 2520 ttactgattt aaatatattc cactgtagtt gtaacacggc tcaacctgta ttacgtatac 2580 tgttggacgc tgctttcaga tttattttat gctgttacag ttttttttta ttgtttcagt 2640 ttgttggatg ttttgtattt tacaccacac actaatgttt agttgatgaa aattctgatg 2700 tcatccattt tgctctgctt taagtacttg cgccttttgt actcggttta catttaatct 2760 aactgtcaaa taaataaaag atatttttac tctgcaaaaa aaaaaaaa 2808 60 407 PRT Danio rerio 60 Met Phe Asn Leu Met Ala Asn Cys Cys Asn Trp Leu Lys Arg Trp Arg 1 5 10 15 Glu Pro Ala Arg Lys Val Thr Leu Val Met Val Gly Leu Asp Asn Ala 20 25 30 Gly Lys Thr Ala Thr Val Arg Gly Ile Gln Gly Glu Ser Pro Leu Asp 35 40 45 Val Ala Pro Thr Val Gly Phe Ser Lys Val Asp Leu Lys Gln Gly Lys 50 55 60 Phe Glu Val Thr Ile Phe Asp Leu Gly Gly Gly Lys Arg Ile Arg Gly 65 70 75 80 Ile Trp Lys Asn Tyr Tyr Ser Glu Ser Tyr Gly Val Val Phe Val Val 85 90 95 Asp Ser Ser Asp Val Gln Arg Ile Gln Glu Thr Arg Asp Thr Met Ala 100 105 110 Glu Val Leu Arg His Pro Arg Ile Ala Gly Lys Pro Val Leu Val Leu 115 120 125 Ala Asn Lys Gln Asp Gln Asp Gly Ala Met Ala Glu Ala Asp Ile Ile 130 135 140 Glu Thr Leu Ser Leu Glu Lys Leu Val Asn Glu Asn Lys Cys Leu Cys 145 150 155 160 Gln Ile Glu Pro Cys Ser Ala Val Leu Gly Tyr Gly Lys Lys Val Asp 165 170 175 Lys Ser Ile Lys Asn Gly Leu Asn Trp Leu Leu Asn Asn Ile Ala Lys 180 185 190 Asp Tyr Glu Ala Ile Ser Glu Arg Val Gln Lys Asp Thr Ala Glu Gln 195 200 205 Lys Ala Gln Glu Glu Gln Asp Lys Lys Glu Arg Ala Glu Arg Val Arg 210 215 220 Arg Ile Arg Glu Glu Arg Asp Arg Gln Glu Arg Glu Glu Ala Glu Arg 225 230 235 240 Glu Gly Arg Thr Leu Lys Glu Glu Glu Leu Asp Asp Val Asn Met Phe 245 250 255 Asn Pro Phe Gln Pro Ile Asn Asn Val Leu Thr Glu Asn Gln Asp Arg 260 265 270 Leu Asn Arg Glu Lys Glu Met Gln Arg Gln Arg Glu Asn Gly Gln Gln 275 280 285 Gly Ser Val Gln Glu Gln Ile Ala Leu Gln Asp Glu Glu Glu Glu Glu 290 295 300 Glu Asp Glu Glu Ser Glu Arg Gln Thr Pro Glu Ser Thr Glu Ser Gly 305 310 315 320 Ala Val Asp Gln Thr Lys Lys Lys Thr Arg Lys Leu Arg Leu Lys Arg 325 330 335 Lys His Arg Val Asp Pro Leu Arg Met Glu Glu Ala Ala Pro Lys Ser 340 345 350 Pro Thr Pro Pro Pro Leu Pro Val Gly Trp Ala Thr Pro Lys Val Ser 355 360 365 Arg Leu Pro Lys Leu Glu Pro Leu Gly Asp Thr Arg His Ser Asp Phe 370 375 380 Tyr Gly Lys Pro Leu Pro Pro Val Ala Ile Arg Gln Arg Pro Asn Ser 385 390 395 400 Asp Thr His Asp Val Ile Ser 405 61 1730 DNA Danio rerio 61 cctaacaagt gtgcaaagcc atctttccct gaagtctttc ctattgacgg acaagctgtt 60 caaccaataa agcacaggtt tttgctcggg atacatcttt cttccatttc gacaatcaac 120 ttttatccaa cttccaagaa cttccatctg aggaggcgct ttgcatttca atttcttgga 180 ggaggaagga tggatgtgag aatgaaccaa ggacacctac ttctggcagt gaccctcatc 240 gtctgcaact cacagctgct ggtggtcgcc aactcgtggt ggtcattagc catgaacccc 300 atccagagac cggagatgta catcattgga gcacagcctc tgtgcagcca gctgacgggc 360 ctatctcagg gtcagaggaa gctctgccag ctctatcagg accacatggt ttatattgga 420 gagggggcga agacgggcat caaagagtgc cagtatcagt tcagacagag gcgatggaac 480 tgcagtacag tggacaacac gtcagtgttc ggccgcgtca tgcatatagg cagccgagaa 540 acagctttta cgtacgccgt cagcgcagcg ggtgttgtga atgctgtgag tcgagcgtgc 600 cgtgagggtg agctttccac ctgcggctgc agtcgagcgg ctcgtcccag agacctgccc 660 agagactggc tgtggggcgg ctgcggggac aacgtcaact atggctaccg cttcgcccgg 720 gagtttgtgg acgctcgtga acgtgagaag aactacccac gcggatcggt ggaacacgca 780 cgcacgctta tgaatctgca gaacaatgaa gccggaagaa tggcggtgta taatctagcg 840 aatgtggcct gcaagtgtca tggcgtctca ggctcgtgca gcttgaaaac ctgctggctc 900 cagctggccg acttccggcg tgttggagaa ttcctgaagg agaaatacga cagcgccgcc 960 gccatgcgca ttaaccgacg tggaaaacta gagctggtca ataatcgatt caacccaccg 1020 acaggtgaag atctggtcta catcgacccc agcccggatt actgcctgcg caatgaaacc 1080 actgggtctc tgggcaccca aggccgccta tgcaacaaga cctcggaggg tatggacggc 1140 tgcgagctca tgtgctgcgg ccgcgggtac gaccagttca agacctacaa acatgagcgc 1200 tgccactgca agttccactg gtgctgctat gtcaagtgca aacgctgcac gtcactcgta 1260 gaccagtttg tgtgcaagta gcagacgtga gaactggggg acagacgcac tgagcaatta 1320 agcaggagaa gaaacgggac ccttacggac ccagaggggc aacttagaga taattaaatg 1380 taaaaatgat atattaaata gcaacaaatt aaaagtatat aaataagtgt acgtgtgccg 1440 ttgatagtaa tttaatgacc tgaggatcca cgttggacgt cattgatgaa aaaggaggca 1500 ccggtgtgat gcattccggc tcaagctgtg tttctcttat atagagagga aggaactctt 1560 gacttgtgct acaagaaact ctttagagac tcgaggaaaa ggcgcagaaa taggatgggg 1620 aacatcaagg gcgcatcacc cacccattgc cttccaattc ttaaacacac acacacacac 1680 acacactcca ccttgctgat gtcagagctc ttaagaactt tcggaatgcg 1730 62 363 PRT Danio rerio 62 Met Asp Val Arg Met Asn Gln Gly His Leu Leu Leu Ala Val Thr Leu 1 5 10 15 Ile Val Cys Asn Ser Gln Leu Leu Val Val Ala Asn Ser Trp Trp Ser 20 25 30 Leu Ala Met Asn Pro Ile Gln Arg Pro Glu Met Tyr Ile Ile Gly Ala 35 40 45 Gln Pro Leu Cys Ser Gln Leu Thr Gly Leu Ser Gln Gly Gln Arg Lys 50 55 60 Leu Cys Gln Leu Tyr Gln Asp His Met Val Tyr Ile Gly Glu Gly Ala 65 70 75 80 Lys Thr Gly Ile Lys Glu Cys Gln Tyr Gln Phe Arg Gln Arg Arg Trp 85 90 95 Asn Cys Ser Thr Val Asp Asn Thr Ser Val Phe Gly Arg Val Met His 100 105 110 Ile Gly Ser Arg Glu Thr Ala Phe Thr Tyr Ala Val Ser Ala Ala Gly 115 120 125 Val Val Asn Ala Val Ser Arg Ala Cys Arg Glu Gly Glu Leu Ser Thr 130 135 140 Cys Gly Cys Ser Arg Ala Ala Arg Pro Arg Asp Leu Pro Arg Asp Trp 145 150 155 160 Leu Trp Gly Gly Cys Gly Asp Asn Val Asn Tyr Gly Tyr Arg Phe Ala 165 170 175 Arg Glu Phe Val Asp Ala Arg Glu Arg Glu Lys Asn Tyr Pro Arg Gly 180 185 190 Ser Val Glu His Ala Arg Thr Leu Met Asn Leu Gln Asn Asn Glu Ala 195 200 205 Gly Arg Met Ala Val Tyr Asn Leu Ala Asn Val Ala Cys Lys Cys His 210 215 220 Gly Val Ser Gly Ser Cys Ser Leu Lys Thr Cys Trp Leu Gln Leu Ala 225 230 235 240 Asp Phe Arg Arg Val Gly Glu Phe Leu Lys Glu Lys Tyr Asp Ser Ala 245 250 255 Ala Ala Met Arg Ile Asn Arg Arg Gly Lys Leu Glu Leu Val Asn Asn 260 265 270 Arg Phe Asn Pro Pro Thr Gly Glu Asp Leu Val Tyr Ile Asp Pro Ser 275 280 285 Pro Asp Tyr Cys Leu Arg Asn Glu Thr Thr Gly Ser Leu Gly Thr Gln 290 295 300 Gly Arg Leu Cys Asn Lys Thr Ser Glu Gly Met Asp Gly Cys Glu Leu 305 310 315 320 Met Cys Cys Gly Arg Gly Tyr Asp Gln Phe Lys Thr Tyr Lys His Glu 325 330 335 Arg Cys His Cys Lys Phe His Trp Cys Cys Tyr Val Lys Cys Lys Arg 340 345 350 Cys Thr Ser Leu Val Asp Gln Phe Val Cys Lys 355 360 63 1624 DNA Danio rerio 63 ggcacgagct tgcgtttgtc agaccgcggc gaatctgctc cacgcattta cagaaaccga 60 gcaaaatcgc atccaagcat cgctctatcg ccgtcacatc ttcttcacat gcggggctga 120 actgagtttg cgcgtttgag acaccccgga gaggattgcg tgcggcgtta tattcgtatt 180 gtgcgtcatt tctgcatttc tgcgtttcca cccaggcaat atggcaacac cagccgctgt 240 aaacccatcg gaaatgggca cggatttgcc ggggccggtg tctatgccag gtgctgttgt 300 tggtgctggt caggtcagga tgacgggtgc aatgccagga cgaggaggca agcggagatc 360 agcggggatg gattttgacg acgaggacgg cgagggaccc agcaaatttt caaggtatga 420 tgatgatcag attcctggtg ataaggagcg ttacgccaga gagaatcaca gcgagatcga 480 gcggcgcaga cgaaataaaa tgacgcagta catcaccgag ctgtcggaca tggtacccac 540 ctgcagcgcc ctggccagaa aaccagacaa actcaccatc ctgcgcatgg ccgtctctca 600 catgaagtcc atgaggggaa ccggaaacac ctcaaccgac ggcgcctaca aaccctcctt 660 cctcactgaa caggagctga agcacttgat tctggaggcg gccgatggct tcttgttcgt 720 ggttgcggct gagacgggac gggtgatcta cgtgtctgat tcggtgactc ctgtcctcaa 780 ccatccgcag tcagagtggt tcggtagcac actgttcgag caggtccatc cagacgacgt 840 tgacaaactg agggagcagc tgagcacgtc tgagaattca atgacaggtc gtattctgga 900 cctgaagacg gggacagtga agaaggaagg tcaacagtcc tctatgagga tgtgtatggg 960 ctccaggcgc tccttcatct gcagaatgag gtgtggcagt gcacctttgg atcacatctc 1020 attgaatcga ttgtccagca tgagaaagag atacaggaat ggtctcggtc cgtctaagga 1080 gggtgaagcg cagtactctg ttgttcattg taccggctac atcaaggcct ggccgccagc 1140 gggcatgact attccagatg aagacacaga agcaggtcaa actagcaaat actgcttggt 1200 tgctattgga agactgcagg tgaccagctc tcctgtttca atggatatga acggactttc 1260 agttccgact gaattccttt cacgccacaa ttcggatggc atcatcacat tcgtggaccc 1320 tcgatgcatc aacgtgattg gctaccaacc tcaggttctt cttggcaaag acattctgga 1380 gttctgccac ccggaggacc agagccacct tcgggagagt ttccagcagg tacgccaaaa 1440 ccagattaaa cccgtcagcc aacaccaatc acatatttcc cacatgatgt ttttttaaca 1500 gatcaaggaa tatttcacag tgtttcctat aatatatatt ttttcttctg gagagggtct 1560 tatttgtttt atttcggcca gaataaaagc agtttttaat aaaaaaaaaa aaaaaaaaaa 1620 aaaa 1624 64 425 PRT Danio rerio 64 Met Ala Thr Pro Ala Ala Val Asn Pro Ser Glu Met Gly Thr Asp Leu 1 5 10 15 Pro Gly Pro Val Ser Met Pro Gly Ala Val Val Gly Ala Gly Gln Val 20 25 30 Arg Met Thr Gly Ala Met Pro Gly Arg Gly Gly Lys Arg Arg Ser Ala 35 40 45 Gly Met Asp Phe Asp Asp Glu Asp Gly Glu Gly Pro Ser Lys Phe Ser 50 55 60 Arg Tyr Asp Asp Asp Gln Ile Pro Gly Asp Lys Glu Arg Tyr Ala Arg 65 70 75 80 Glu Asn His Ser Glu Ile Glu Arg Arg Arg Arg Asn Lys Met Thr Gln 85 90 95 Tyr Ile Thr Glu Leu Ser Asp Met Val Pro Thr Cys Ser Ala Leu Ala 100 105 110 Arg Lys Pro Asp Lys Leu Thr Ile Leu Arg Met Ala Val Ser His Met 115 120 125 Lys Ser Met Arg Gly Thr Gly Asn Thr Ser Thr Asp Gly Ala Tyr Lys 130 135 140 Pro Ser Phe Leu Thr Glu Gln Glu Leu Lys His Leu Ile Leu Glu Ala 145 150 155 160 Ala Asp Gly Phe Leu Phe Val Val Ala Ala Glu Thr Gly Arg Val Ile 165 170 175 Tyr Val Ser Asp Ser Val Thr Pro Val Leu Asn His Pro Gln Ser Glu 180 185 190 Trp Phe Gly Ser Thr Leu Phe Glu Gln Val His Pro Asp Asp Val Asp 195 200 205 Lys Leu Arg Glu Gln Leu Ser Thr Ser Glu Asn Ser Met Thr Gly Arg 210 215 220 Ile Leu Asp Leu Lys Thr Gly Thr Val Lys Lys Glu Gly Gln Gln Ser 225 230 235 240 Ser Met Arg Met Cys Met Gly Ser Arg Arg Ser Phe Ile Cys Arg Met 245 250 255 Arg Cys Gly Ser Ala Pro Leu Asp His Ile Ser Leu Asn Arg Leu Ser 260 265 270 Ser Met Arg Lys Arg Tyr Arg Asn Gly Leu Gly Pro Ser Lys Glu Gly 275 280 285 Glu Ala Gln Tyr Ser Val Val His Cys Thr Gly Tyr Ile Lys Ala Trp 290 295 300 Pro Pro Ala Gly Met Thr Ile Pro Asp Glu Asp Thr Glu Ala Gly Gln 305 310 315 320 Thr Ser Lys Tyr Cys Leu Val Ala Ile Gly Arg Leu Gln Val Thr Ser 325 330 335 Ser Pro Val Ser Met Asp Met Asn Gly Leu Ser Val Pro Thr Glu Phe 340 345 350 Leu Ser Arg His Asn Ser Asp Gly Ile Ile Thr Phe Val Asp Pro Arg 355 360 365 Cys Ile Asn Val Ile Gly Tyr Gln Pro Gln Val Leu Leu Gly Lys Asp 370 375 380 Ile Leu Glu Phe Cys His Pro Glu Asp Gln Ser His Leu Arg Glu Ser 385 390 395 400 Phe Gln Gln Val Arg Gln Asn Gln Ile Lys Pro Val Ser Gln His Gln 405 410 415 Ser His Ile Ser His Met Met Phe Phe 420 425 65 1135 DNA Danio rerio 65 ctaaagggca agcagtggta acaacgcaga gtacgcgggg ggggctcgat gaccgcggct 60 gcagcagcgg acgtgtttga aaacatcagg aggaattgaa ggatgatttg ggcgtcagcg 120 ctgatcttca tcaccgcgct cagctccggt gtgtgtgatg acggacacga gatggaggag 180 ttcctcaaga gggagtacac actgtccaaa ccgtaccagg atgtgggcgt gtcgggctcg 240 tctcattggg agctgatggg agacgctctg gtctcctctg attatgttcg tctgactcca 300 gatcagcaga gcaaacaggg agccatatgg agccgcatgc cttgtcacct gagcgactgg 360 gagctgcagg tccactttag ggttcacggt cagggaaaga agaacctgaa cggtgatggt 420 ctggccgtct ggtacactaa agagcgcatg cagagagggc cggtgtttgg aaacagggac 480 ttcttcacag gccttggtgt gtttgtggac acgtatccca acgaggagaa gcttctggag 540 gctcataaga agcggtatac tccacgcaca cagcgtatat tcccgtatgt gttggctatg 600 gttggcaacg gcagcatcag ctatgatcac gatcgggatg ggcgacccac agagctgggc 660 ggctgcaacg ccatggtgcg aaaccagaaa cacgaaacct tcctcttcat cagatacgtc 720 cggcgccgac tcacggtcat gatggatatc gacgggcagc atgagtggag agactgtctg 780 gatgttcctg gtgttcgtct gccgcagggc ttttacttcg gggcttcagc tgtcacagga 840 gacctttcag ataatcatga cctgatctcc atgaagctct accagctgac gatcctgcgc 900 agcaagcagg aggacgagga gcaggaggag gttctggttc ccagcgtgga caacatcgac 960 ctgctgaggc ctcccgtaga tgaggagagt gtgagcagcg tgggcatctt cttcagcgtg 1020 ctcttcactc tgctgggggt gtttctgctg gtggtggtgg gactggtgct gtatgaacac 1080 tggagcgaga gccgccgcaa gcgcttctac tgacacacac acacacacac acaca 1135 66 336 PRT Danio rerio 66 Met Ile Trp Ala Ser Ala Leu Ile Phe Ile Thr Ala Leu Ser Ser Gly 1 5 10 15 Val Cys Asp Asp Gly His Glu Met Glu Glu Phe Leu Lys Arg Glu Tyr 20 25 30 Thr Leu Ser Lys Pro Tyr Gln Asp Val Gly Val Ser Gly Ser Ser His 35 40 45 Trp Glu Leu Met Gly Asp Ala Leu Val Ser Ser Asp Tyr Val Arg Leu 50 55 60 Thr Pro Asp Gln Gln Ser Lys Gln Gly Ala Ile Trp Ser Arg Met Pro 65 70 75 80 Cys His Leu Ser Asp Trp Glu Leu Gln Val His Phe Arg Val His Gly 85 90 95 Gln Gly Lys Lys Asn Leu Asn Gly Asp Gly Leu Ala Val Trp Tyr Thr 100 105 110 Lys Glu Arg Met Gln Arg Gly Pro Val Phe Gly Asn Arg Asp Phe Phe 115 120 125 Thr Gly Leu Gly Val Phe Val Asp Thr Tyr Pro Asn Glu Glu Lys Leu 130 135 140 Leu Glu Ala His Lys Lys Arg Tyr Thr Pro Arg Thr Gln Arg Ile Phe 145 150 155 160 Pro Tyr Val Leu Ala Met Val Gly Asn Gly Ser Ile Ser Tyr Asp His 165 170 175 Asp Arg Asp Gly Arg Pro Thr Glu Leu Gly Gly Cys Asn Ala Met Val 180 185 190 Arg Asn Gln Lys His Glu Thr Phe Leu Phe Ile Arg Tyr Val Arg Arg 195 200 205 Arg Leu Thr Val Met Met Asp Ile Asp Gly Gln His Glu Trp Arg Asp 210 215 220 Cys Leu Asp Val Pro Gly Val Arg Leu Pro Gln Gly Phe Tyr Phe Gly 225 230 235 240 Ala Ser Ala Val Thr Gly Asp Leu Ser Asp Asn His Asp Leu Ile Ser 245 250 255 Met Lys Leu Tyr Gln Leu Thr Ile Leu Arg Ser Lys Gln Glu Asp Glu 260 265 270 Glu Gln Glu Glu Val Leu Val Pro Ser Val Asp Asn Ile Asp Leu Leu 275 280 285 Arg Pro Pro Val Asp Glu Glu Ser Val Ser Ser Val Gly Ile Phe Phe 290 295 300 Ser Val Leu Phe Thr Leu Leu Gly Val Phe Leu Leu Val Val Val Gly 305 310 315 320 Leu Val Leu Tyr Glu His Trp Ser Glu Ser Arg Arg Lys Arg Phe Tyr 325 330 335 67 2074 DNA Danio rerio 67 tgttctgtga gcgcgtgctg gagctgcggc agacatgaag agaccgaagt tgaagaaggc 60 gagcaagcgc ttatcatgcg ctaaacgcta caaaatacaa aagaaggttc gggagcacaa 120 tcgtaaatta aagaaagccg caaaaaaaca aggcataagt cgaaaggcca aaaaggatat 180 tggagtccca aacagtgcgc cttttaaaga ggaagtgctt cgcgaagctg agcagagaaa 240 acaggagctt gaaactctga aagagcaaaa caagatcgtg aagcagcaag aaaaggctgc 300 caagcggaaa aaggagaaag atgcagccag ttcagtaaaa gagcctgcag ccaagaaagc 360 taaaaaggca gccaaaatca aagaagcaag agcagccata gttaaagtga agagcgctaa 420 aacattcaaa tgccaagaac tcaacaaggt gattgaggcg tctgatgtga tcgtggaggt 480 tttggatgcc agagatcctc tgggctgccg ctgtcctcaa ctggaggaaa tggtcctgaa 540 acacgaggga aagaagaaac tcttgttcat attgaacaaa atagatcttg ttcctaaaga 600 taacctagag aagtggctcc actttcttga agccgagtgt ccaacttttc tgttcaaatc 660 atcgatgcaa ctaaaggaca gaacagtgca acagaagagg cagcagcgag gaactaacgc 720 agttctggat cacagccgag cagcttcctg ttttggaaaa gactttctcc tacagacact 780 taatgacttg gcgaacaaga aagagggcga aaccatgcta aaagttggcg tcgtcggttt 840 tcctaacgtt ggaaagagca gcatcatcaa cagtatgaaa gaaatgcgag cgtgcaatgc 900 tggcgtacag agaggattga ctaggtgtat gcaggaagtg catattacca agaaagtgaa 960 gatgattgac agtcccggga ttttggcggc cctcagtaac cctggaagtg cgatggccct 1020 caggagcctg caggtggagg agaaagagga gagtccgcag gaagccgtca ggaatctact 1080 caaacagtgc aatcagcagc atataatgct acagtataat gtcccagact acagaagctc 1140 cttagagttt ttgaccacct ttgccatgaa gcacgggctc ctgcagaaag gtggcgttgc 1200 agacactgaa ctggcagcca caacattcct caacgactgg acaggggcta agctgagtta 1260 ctacagcaga gttccagagc gacagggtct tccctcttac ctgtccgacg ccatcgtgat 1320 tgaactgcag tcagatgtgg atatggatgc tgtgaagaag gggaacgaga atgtaaagag 1380 aagtgttcgc tttccaaacc tcgcgagctg cataagcttt gactccagtg gtcccactgc 1440 tggagtgctg gatgtcagtg agctgcctaa agagatctta acaaaggcag caacaacaac 1500 agatgcagaa gaagagaaga tggatacgac aaccaacaca gatgagcctg aggccgaaag 1560 ccatatttca tcaaccgtgg aaccgattca ggaaccaaca gaaaagagaa aagataaacc 1620 agcaaaagaa gtgaagtttg taccagtcaa cactgacctg acgtcaatgc agaacaagaa 1680 caatgaggat gcatacgatt tcaacacaga ctttgtgtaa aaattcacta taattattta 1740 catggactga gaggaattaa atagtaaaat gcactcagtt gcagtgcatc tcatgtttgt 1800 atgtaataat acataaaatg tttgtttcac taaatacaag gtaaaactct tacattgctt 1860 acaaaactta tactgagctt tattcaatac ttggttagaa ccattttccc ctttatgtac 1920 aaattagcac actttcccct tcagatttga agctgaaata ttcatgtaac agaaacgtgg 1980 tgtcaaaaat agacatttga actatcacag gtttgtctta aagtcctgaa caagtcgact 2040 acactacctg gcaaaaaaaa aaaaaaaaaa aaaa 2074 68 561 PRT Danio rerio 68 Met Lys Arg Pro Lys Leu Lys Lys Ala Ser Lys Arg Leu Ser Cys Ala 1 5 10 15 Lys Arg Tyr Lys Ile Gln Lys Lys Val Arg Glu His Asn Arg Lys Leu 20 25 30 Lys Lys Ala Ala Lys Lys Gln Gly Ile Ser Arg Lys Ala Lys Lys Asp 35 40 45 Ile Gly Val Pro Asn Ser Ala Pro Phe Lys Glu Glu Val Leu Arg Glu 50 55 60 Ala Glu Gln Arg Lys Gln Glu Leu Glu Thr Leu Lys Glu Gln Asn Lys 65 70 75 80 Ile Val Lys Gln Gln Glu Lys Ala Ala Lys Arg Lys Lys Glu Lys Asp 85 90 95 Ala Ala Ser Ser Val Lys Glu Pro Ala Ala Lys Lys Ala Lys Lys Ala 100 105 110 Ala Lys Ile Lys Glu Ala Arg Ala Ala Ile Val Lys Val Lys Ser Ala 115 120 125 Lys Thr Phe Lys Cys Gln Glu Leu Asn Lys Val Ile Glu Ala Ser Asp 130 135 140 Val Ile Val Glu Val Leu Asp Ala Arg Asp Pro Leu Gly Cys Arg Cys 145 150 155 160 Pro Gln Leu Glu Glu Met Val Leu Lys His Glu Gly Lys Lys Lys Leu 165 170 175 Leu Phe Ile Leu Asn Lys Ile Asp Leu Val Pro Lys Asp Asn Leu Glu 180 185 190 Lys Trp Leu His Phe Leu Glu Ala Glu Cys Pro Thr Phe Leu Phe Lys 195 200 205 Ser Ser Met Gln Leu Lys Asp Arg Thr Val Gln Gln Lys Arg Gln Gln 210 215 220 Arg Gly Thr Asn Ala Val Leu Asp His Ser Arg Ala Ala Ser Cys Phe 225 230 235 240 Gly Lys Asp Phe Leu Leu Gln Thr Leu Asn Asp Leu Ala Asn Lys Lys 245 250 255 Glu Gly Glu Thr Met Leu Lys Val Gly Val Val Gly Phe Pro Asn Val 260 265 270 Gly Lys Ser Ser Ile Ile Asn Ser Met Lys Glu Met Arg Ala Cys Asn 275 280 285 Ala Gly Val Gln Arg Gly Leu Thr Arg Cys Met Gln Glu Val His Ile 290 295 300 Thr Lys Lys Val Lys Met Ile Asp Ser Pro Gly Ile Leu Ala Ala Leu 305 310 315 320 Ser Asn Pro Gly Ser Ala Met Ala Leu Arg Ser Leu Gln Val Glu Glu 325 330 335 Lys Glu Glu Ser Pro Gln Glu Ala Val Arg Asn Leu Leu Lys Gln Cys 340 345 350 Asn Gln Gln His Ile Met Leu Gln Tyr Asn Val Pro Asp Tyr Arg Ser 355 360 365 Ser Leu Glu Phe Leu Thr Thr Phe Ala Met Lys His Gly Leu Leu Gln 370 375 380 Lys Gly Gly Val Ala Asp Thr Glu Leu Ala Ala Thr Thr Phe Leu Asn 385 390 395 400 Asp Trp Thr Gly Ala Lys Leu Ser Tyr Tyr Ser Arg Val Pro Glu Arg 405 410 415 Gln Gly Leu Pro Ser Tyr Leu Ser Asp Ala Ile Val Ile Glu Leu Gln 420 425 430 Ser Asp Val Asp Met Asp Ala Val Lys Lys Gly Asn Glu Asn Val Lys 435 440 445 Arg Ser Val Arg Phe Pro Asn Leu Ala Ser Cys Ile Ser Phe Asp Ser 450 455 460 Ser Gly Pro Thr Ala Gly Val Leu Asp Val Ser Glu Leu Pro Lys Glu 465 470 475 480 Ile Leu Thr Lys Ala Ala Thr Thr Thr Asp Ala Glu Glu Glu Lys Met 485 490 495 Asp Thr Thr Thr Asn Thr Asp Glu Pro Glu Ala Glu Ser His Ile Ser 500 505 510 Ser Thr Val Glu Pro Ile Gln Glu Pro Thr Glu Lys Arg Lys Asp Lys 515 520 525 Pro Ala Lys Glu Val Lys Phe Val Pro Val Asn Thr Asp Leu Thr Ser 530 535 540 Met Gln Asn Lys Asn Asn Glu Asp Ala Tyr Asp Phe Asn Thr Asp Phe 545 550 555 560 Val 69 1584 DNA Danio rerio 69 ttttatgttc ctgtggagtc cggaagccag tagtgtcccg tgttgcaaat cggaatttat 60 aaacagggct ctccttgaga tgttctcagg gtttcaggaa cggtacgacc atgaggacga 120 gctgtacagg gatgaagaag actcatcagg tttgtctgat tcggacagcg agttggaatt 180 ccggcttcat tcccagctgc actacaacgc agagttccag gaaaaccatg aggaatcaag 240 caaagtccaa ccattggtcc cacaaactca acttcagccg gaaaccatgc ctcctgctcc 300 acctgtagat gttatcgtga ttgacttcag ggcagattct tttacagtgt cagacagcac 360 agaggatgat gagagtgtgt gtgctaataa agttcagtca ttaaaacatt gtaaaaggaa 420 gtcacagccc agtcgcttcc gttcaccttc tccgcctcag gttcaagcag gaaggagctc 480 tccggatgat gttgtggtgc tggattctga tttagaacaa tcctcttctg aatctgaacc 540 tccttttgtg gaggatttgg actcggatac agactctgat tcagactctg atggactgga 600 gaactggatg attctgggca aggggagaca agatgaagat cagagcatcc agctgaacct 660 cgcgcagagt agttttgtga atacagaata tgtaaattac agagggcaag agcaatgtgg 720 actgtatcag aaaaggataa acaggctcag atctataata aagggcacgg cccgaggcat 780 gtcccaaacc gctccacata cagatactac actgaaaaat catcacgtgt cgcaactgca 840 ataaactggt catctctcca aaaactgccc cacattaaag aaggtgccgt gctgctcatt 900 atgtgggctt cggggacact tactgaggac ctgtccaaac agtcactgct cgaactgttc 960 gctgcccggc ccacctctga tgactgtctg gagagagcct tttggtacaa aagatgccac 1020 cgctgcggca tgaccggaca cttcatcgac gcctgtccac aaatctggag acagtaccac 1080 cttacaacca ctgcaggacc catccgcaag tctgctgatc caaaggcctg ccagaaacgt 1140 gcctattgtt ataactgctc caggaaagga cactttggcc atcaatgctc tcagagaagg 1200 atgtacaatt ggtcctatcc atccttgcca gttatcacct attatgacac agtgaatgac 1260 atcaaatgcc gggatttccg tctgaaaaag aaggccagag aaatgcagga cgccggactg 1320 atttctcctg atggaggcgt tgtaacattt acccctcaac cacccagaaa gaaacaaaaa 1380 gtcagccaca gtccacatcc gtactcaaac aacacaaaca accaccacac accgaagaaa 1440 cgcatcgcac acacacccaa acacaatcca caaacaaacc aacaaaacac acactggagg 1500 gtcaagcaga actccaaaaa accaggaccc cacggcaaaa acacccctca aagcaaagag 1560 ttcaaatctc aaaataaaga agcc 1584 70 526 PRT Danio rerio 70 Met Phe Leu Trp Ser Pro Glu Ala Ser Ser Val Pro Cys Cys Lys Ser 1 5 10 15 Glu Phe Ile Asn Arg Ala Leu Leu Glu Met Phe Ser Gly Phe Gln Glu 20 25 30 Arg Tyr Asp His Glu Asp Glu Leu Tyr Arg Asp Glu Glu Asp Ser Ser 35 40 45 Gly Leu Ser Asp Ser Asp Ser Glu Leu Glu Phe Arg Leu His Ser Gln 50 55 60 Leu His Tyr Asn Ala Glu Phe Gln Glu Asn His Glu Glu Ser Ser Lys 65 70 75 80 Val Gln Pro Leu Val Pro Gln Thr Gln Leu Gln Pro Glu Thr Met Pro 85 90 95 Pro Ala Pro Pro Val Asp Val Ile Val Ile Asp Phe Arg Ala Asp Ser 100 105 110 Phe Thr Val Ser Asp Ser Thr Glu Asp Asp Glu Ser Val Cys Ala Asn 115 120 125 Lys Val Gln Ser Leu Lys His Cys Lys Arg Lys Ser Gln Pro Ser Arg 130 135 140 Phe Arg Ser Pro Ser Pro Pro Gln Val Gln Ala Gly Arg Ser Ser Pro 145 150 155 160 Asp Asp Val Val Val Leu Asp Ser Asp Leu Glu Gln Ser Ser Ser Glu 165 170 175 Ser Glu Pro Pro Phe Val Glu Asp Leu Asp Ser Asp Thr Asp Ser Asp 180 185 190 Ser Asp Ser Asp Gly Leu Glu Asn Trp Met Ile Leu Gly Lys Gly Arg 195 200 205 Gln Asp Glu Asp Gln Ser Ile Gln Leu Asn Leu Ala Gln Ser Ser Phe 210 215 220 Val Asn Thr Glu Tyr Val Asn Tyr Arg Gly Gln Glu Gln Cys Gly Leu 225 230 235 240 Tyr Gln Lys Arg Ile Asn Arg Leu Arg Ser Ile Ile Lys Gly Thr Ala 245 250 255 Arg Gly Met Ser Gln Thr Ala Pro His Thr Asp Thr Thr Leu Lys Asn 260 265 270 His His Val Ser Gln Leu Gln Asn Trp Ser Ser Leu Gln Lys Leu Pro 275 280 285 His Ile Lys Glu Gly Ala Val Leu Leu Ile Met Trp Ala Ser Gly Thr 290 295 300 Leu Thr Glu Asp Leu Ser Lys Gln Ser Leu Leu Glu Leu Phe Ala Ala 305 310 315 320 Arg Pro Thr Ser Asp Asp Cys Leu Glu Arg Ala Phe Trp Tyr Lys Arg 325 330 335 Cys His Arg Cys Gly Met Thr Gly His Phe Ile Asp Ala Cys Pro Gln 340 345 350 Ile Trp Arg Gln Tyr His Leu Thr Thr Thr Ala Gly Pro Ile Arg Lys 355 360 365 Ser Ala Asp Pro Lys Ala Cys Gln Lys Arg Ala Tyr Cys Tyr Asn Cys 370 375 380 Ser Arg Lys Gly His Phe Gly His Gln Cys Ser Gln Arg Arg Met Tyr 385 390 395 400 Asn Trp Ser Tyr Pro Ser Leu Pro Val Ile Thr Tyr Tyr Asp Thr Val 405 410 415 Asn Asp Ile Lys Cys Arg Asp Phe Arg Leu Lys Lys Lys Ala Arg Glu 420 425 430 Met Gln Asp Ala Gly Leu Ile Ser Pro Asp Gly Gly Val Val Thr Phe 435 440 445 Thr Pro Gln Pro Pro Arg Lys Lys Gln Lys Val Ser His Ser Pro His 450 455 460 Pro Tyr Ser Asn Asn Thr Asn Asn His His Thr Pro Lys Lys Arg Ile 465 470 475 480 Ala His Thr Pro Lys His Asn Pro Gln Thr Asn Gln Gln Asn Thr His 485 490 495 Trp Arg Val Lys Gln Asn Ser Lys Lys Pro Gly Pro His Gly Lys Asn 500 505 510 Thr Pro Gln Ser Lys Glu Phe Lys Ser Gln Asn Lys Glu Ala 515 520 525 71 513 DNA Danio rerio 71 ggaaagttcg gaggtcagcg ctgcgtcgac gcttgctagc ttgttctctg cgctcactct 60 tttataatgg caaaacatca cccagatttg atcttttgca gaaaacaggc tggagttgcc 120 attggaagac tgtgtgaaaa atgtgatggc aaatgcgtga tctgcgactc ttatgtcaga 180 ccgtgtacac tggttcgtat ttgcgatgaa tgcaactatg gctcttacca aggacgctgc 240 gtgatctgtg gaggtcctgg agtgtctgat gcctattact gcaaagagtg caccatacag 300 gagaaagaca gagacggctg tcccaagatc gtaaatctgg gcagctcaaa gactgatctg 360 ttctatgaga ggaaaaaata cggcttcaag aagaggtgaa gaaagaaacg gaaggagttt 420 tttttttgca aaaatctttt tttttgggcc caatttttta ttttggtttt caacctaatt 480 ggaataaaaa aattttttta aaaaaaaaaa aaa 513 72 110 PRT Danio rerio 72 Met Ala Lys His His Pro Asp Leu Ile Phe Cys Arg Lys Gln Ala Gly 1 5 10 15 Val Ala Ile Gly Arg Leu Cys Glu Lys Cys Asp Gly Lys Cys Val Ile 20 25 30 Cys Asp Ser Tyr Val Arg Pro Cys Thr Leu Val Arg Ile Cys Asp Glu 35 40 45 Cys Asn Tyr Gly Ser Tyr Gln Gly Arg Cys Val Ile Cys Gly Gly Pro 50 55 60 Gly Val Ser Asp Ala Tyr Tyr Cys Lys Glu Cys Thr Ile Gln Glu Lys 65 70 75 80 Asp Arg Asp Gly Cys Pro Lys Ile Val Asn Leu Gly Ser Ser Lys Thr 85 90 95 Asp Leu Phe Tyr Glu Arg Lys Lys Tyr Gly Phe Lys Lys Arg 100 105 110 73 2001 DNA Danio rerio 73 accacttctg aacgcgactt gcatcctgca gcgatgactc tttttcacca tgttgttgat 60 cgaggggcta aaaaaagagg acgaaatggt gagcagcggc tgtttcctct gtcgtctctg 120 ctggactgct atgaatgcgc tcggcgggat gaacacatct cttatggagc ctccgctgct 180 gccgtgccgc cattcggatg cacattttcc tccgctcatg gacaacaaaa ctgtcttgct 240 gtagccaacg aagaagggtt tgtgaccatc ttcaacactg gagaaaaaca aagctctgtt 300 ctgaaagaat ggcaggcaca tgataatgca gtttttgata ttgcctgggt tcctggaacc 360 aactgcttag taactgcatc cggtgaccaa actgctcgtc tgtgggatgt tattactggt 420 gacctgctgg gaacatttaa aggacatcaa tgcagtctca aatcagtagc tttttacaag 480 caagagaaag cagtatttag cactggaggc agagatggaa acataatgat ttgggataca 540 agatgcagta aaaaagatgg tttctacagg caagtaaagc agatcagtgg tgctcatatg 600 aaacctgaaa gattcactcc tcaaacaaag aagagacgtg gtatggcgcc ccctgtggat 660 tcccaacaag gtgtgacagt ggttttgttt tgcgatgaga ccaaactcat ctcctcaggg 720 gcggttgatg ggatcattaa aatgtgggat ttgcggagga actatactgc ataccaccag 780 aaccccctgc ctctacaggc ctatccttac ccagggtcct gcacgcgcaa actaggttat 840 tctggtcttt cactggacta cactggatct agactcttct ctaactgcac agatgacaat 900 atctacatgt ttaatatcag tgggctgaaa accactccaa tcgctgtgtt cagtggtcac 960 agtaactcct cattttacgt gaagtccact gtgagtccag acagttacag gtgccgcttt 1020 acccctccca tcaaacacct cttttgcacc tccagccaag ctcaccagcc ccaaaatgcc 1080 atcatccctt caacaatgga tcagccgcag cagcaaatct cctgttcgaa aagccctcac 1140 acctgttctc cagggtctgt ccttcgaacg ccgtgtcaaa cgccgactgg agacgggcga 1200 ttctgccagc tctggtttgg gagaggagat cgacggtgtg tctgaacttt atcccaatgt 1260 caagaggagt aggagttcag taagcacact caagaaagag gactcgtttg gtttggaaag 1320 tgagaagagg caaggctcgg atggtgcaga ggcctctggg aaggagaaca gctctcccag 1380 gaggactgat tggctttctg tgatcagcca gaagtttaaa gggtcggctc agcctaagag 1440 ccccagcagt ggcagcagtc agcaggatac aagaacacta gaatctccag cagcagtctc 1500 acctcgtccc atgaaagttt tctctccacc aacaaacaag aaagcatcac cttccaaacc 1560 tatgaaaaaa atctcaagct actttatgaa aagaacccaa gactgacgtg cgtgggtgca 1620 tgcctgcatg cctgtgtgcg tgcatgttta aagagattac aagtgtcttt cattctacac 1680 atttatggtt taaattcagt ttcagctttt cttgtaactg agcttcaaac caaacctaga 1740 aaagatttgc agattctgcc tgtatttatg tatgtaatgt gtattttttt atctagttgt 1800 gcatttattt ttaaatgtgt aaaaatattt ttctagaaaa gggtcaaatg tatgatcatt 1860 tacagtcatt tgacaccaca gttgaattct aaagtttctc tgataccttc atgtctgtca 1920 gacagtttga aacactttgc tccatttaac atgcttttta aactatatta aattatattt 1980 gggtgttaaa aaaaaaaaaa a 2001 74 522 PRT Danio rerio 74 Met Thr Leu Phe His His Val Val Asp Arg Gly Ala Lys Lys Arg Gly 1 5 10 15 Arg Asn Gly Glu Gln Arg Leu Phe Pro Leu Ser Ser Leu Leu Asp Cys 20 25 30 Tyr Glu Cys Ala Arg Arg Asp Glu His Ile Ser Tyr Gly Ala Ser Ala 35 40 45 Ala Ala Val Pro Pro Phe Gly Cys Thr Phe Ser Ser Ala His Gly Gln 50 55 60 Gln Asn Cys Leu Ala Val Ala Asn Glu Glu Gly Phe Val Thr Ile Phe 65 70 75 80 Asn Thr Gly Glu Lys Gln Ser Ser Val Leu Lys Glu Trp Gln Ala His 85 90 95 Asp Asn Ala Val Phe Asp Ile Ala Trp Val Pro Gly Thr Asn Cys Leu 100 105 110 Val Thr Ala Ser Gly Asp Gln Thr Ala Arg Leu Trp Asp Val Ile Thr 115 120 125 Gly Asp Leu Leu Gly Thr Phe Lys Gly His Gln Cys Ser Leu Lys Ser 130 135 140 Val Ala Phe Tyr Lys Gln Glu Lys Ala Val Phe Ser Thr Gly Gly Arg 145 150 155 160 Asp Gly Asn Ile Met Ile Trp Asp Thr Arg Cys Ser Lys Lys Asp Gly 165 170 175 Phe Tyr Arg Gln Val Lys Gln Ile Ser Gly Ala His Met Lys Pro Glu 180 185 190 Arg Phe Thr Pro Gln Thr Lys Lys Arg Arg Gly Met Ala Pro Pro Val 195 200 205 Asp Ser Gln Gln Gly Val Thr Val Val Leu Phe Cys Asp Glu Thr Lys 210 215 220 Leu Ile Ser Ser Gly Ala Val Asp Gly Ile Ile Lys Met Trp Asp Leu 225 230 235 240 Arg Arg Asn Tyr Thr Ala Tyr His Gln Asn Pro Leu Pro Leu Gln Ala 245 250 255 Tyr Pro Tyr Pro Gly Ser Cys Thr Arg Lys Leu Gly Tyr Ser Gly Leu 260 265 270 Ser Leu Asp Tyr Thr Gly Ser Arg Leu Phe Ser Asn Cys Thr Asp Asp 275 280 285 Asn Ile Tyr Met Phe Asn Ile Ser Gly Leu Lys Thr Thr Pro Ile Ala 290 295 300 Val Phe Ser Gly His Ser Asn Ser Ser Phe Tyr Val Lys Ser Thr Val 305 310 315 320 Ser Pro Asp Ser Tyr Arg Cys Arg Phe Thr Pro Pro Ile Lys His Leu 325 330 335 Phe Cys Thr Ser Ser Gln Ala His Gln Pro Gln Asn Ala Ile Ile Pro 340 345 350 Ser Thr Met Asp Gln Pro Gln Gln Gln Ile Ser Cys Ser Lys Ser Pro 355 360 365 His Thr Cys Ser Pro Gly Ser Val Leu Arg Thr Pro Cys Gln Thr Pro 370 375 380 Thr Gly Asp Gly Arg Phe Cys Gln Leu Trp Phe Gly Arg Gly Asp Arg 385 390 395 400 Arg Cys Val Leu Tyr Pro Asn Val Lys Arg Ser Arg Ser Ser Val Ser 405 410 415 Thr Leu Lys Lys Glu Asp Ser Phe Gly Leu Glu Ser Glu Lys Arg Gln 420 425 430 Gly Ser Asp Gly Ala Glu Ala Ser Gly Lys Glu Asn Ser Ser Pro Arg 435 440 445 Arg Thr Asp Trp Leu Ser Val Ile Ser Gln Lys Phe Lys Gly Ser Ala 450 455 460 Gln Pro Lys Ser Pro Ser Ser Gly Ser Ser Gln Gln Asp Thr Arg Thr 465 470 475 480 Leu Glu Ser Pro Ala Ala Val Ser Pro Arg Pro Met Lys Val Phe Ser 485 490 495 Pro Pro Thr Asn Lys Lys Ala Ser Pro Ser Lys Pro Met Lys Lys Ile 500 505 510 Ser Ser Tyr Phe Met Lys Arg Thr Gln Asp 515 520 75 1328 DNA Danio rerio 75 ggaccgtaga cctgctagct cttagttttg cttttattta tttctagtta actttgtttt 60 agaagttgtc tttattttta acctttaact tatcctcgtt ttgttttacc aaacactcca 120 agctttacaa tgtcgtccac tcgctctccc ctgaaaacca agaatgaaaa cacaatctct 180 actaaaatga ataacatgtc ctttgtggac aaagaaaaca cgccacccag tttgagctcc 240 accagaatcc tggcgtccaa aacggcgcgc aaaatctttg atgaatcaga ggggcagtcg 300 aaagcaaaga aaggagcagt ggaggaggag cctcttctga aggagaaccc ccatcgcttt 360 gtcattttcc caattcagta tcatgacatc tggcagatgt acaagaaggc agaggcctct 420 ttctggacag ctgaggaggt tgatctgtcc aaagatctgc aacactggga ttccctgaaa 480 gacgaggaga gatatttcat ctcccatgtt ctggctttct ttgctgcaag tgatggcatt 540 gtcaatgaga acttggtgga gcgtttcact caggaggtcc aggtgactga agcccgctgc 600 ttctatggct tccagatagc aatggaaaac atccactcag aaatgtacag cctgttgatt 660 gacacctaca ttaaagattc taaagagagg gaatttctgt tcaatgccat tgagacaatg 720 ccatgtgtaa agaagaaggc tgactgggca ctcaactgga ttggtgataa aaatgcacga 780 tatggtgaga gagtggtggc ctttgctgca gtggagggaa tcttcttctc tgggtctttt 840 gcttctattt tctggctaaa gaagagggga ctcatgcctg gactcacctt ctccaatgag 900 ctgatcagca gagatgaggg tcttcattgc gacttcgcct gcctcatgtt taaacacttg 960 atcaacaaac catcagaaga aaccgtgaag aaaatcatca tgaatgcagt tgaaattgag 1020 caggagttcc tgactgatgc tcttcccgtg aagctcattg gcatgaactg tgacctgatg 1080 aagcagtaca ttgagtttgt ggctgaccga ctgttgctgg agctgggatt tgacaaggtc 1140 tatagagtgg agaatccttt tgacttcatg gagaacattt cgttggaggg caagaccaac 1200 ttcttcgaga agcgagtggg cgagtaccag cgcatgggag tcatgtcggg aactacagat 1260 aacactttca ctctggatgc tgatttttaa agactcaatc tgagtgcttc aacggacact 1320 taacacaa 1328 76 386 PRT Danio rerio 76 Met Ser Ser Thr Arg Ser Pro Leu Lys Thr Lys Asn Glu Asn Thr Ile 1 5 10 15 Ser Thr Lys Met Asn Asn Met Ser Phe Val Asp Lys Glu Asn Thr Pro 20 25 30 Pro Ser Leu Ser Ser Thr Arg Ile Leu Ala Ser Lys Thr Ala Arg Lys 35 40 45 Ile Phe Asp Glu Ser Glu Gly Gln Ser Lys Ala Lys Lys Gly Ala Val 50 55 60 Glu Glu Glu Pro Leu Leu Lys Glu Asn Pro His Arg Phe Val Ile Phe 65 70 75 80 Pro Ile Gln Tyr His Asp Ile Trp Gln Met Tyr Lys Lys Ala Glu Ala 85 90 95 Ser Phe Trp Thr Ala Glu Glu Val Asp Leu Ser Lys Asp Leu Gln His 100 105 110 Trp Asp Ser Leu Lys Asp Glu Glu Arg Tyr Phe Ile Ser His Val Leu 115 120 125 Ala Phe Phe Ala Ala Ser Asp Gly Ile Val Asn Glu Asn Leu Val Glu 130 135 140 Arg Phe Thr Gln Glu Val Gln Val Thr Glu Ala Arg Cys Phe Tyr Gly 145 150 155 160 Phe Gln Ile Ala Met Glu Asn Ile His Ser Glu Met Tyr Ser Leu Leu 165 170 175 Ile Asp Thr Tyr Ile Lys Asp Ser Lys Glu Arg Glu Phe Leu Phe Asn 180 185 190 Ala Ile Glu Thr Met Pro Cys Val Lys Lys Lys Ala Asp Trp Ala Leu 195 200 205 Asn Trp Ile Gly Asp Lys Asn Ala Arg Tyr Gly Glu Arg Val Val Ala 210 215 220 Phe Ala Ala Val Glu Gly Ile Phe Phe Ser Gly Ser Phe Ala Ser Ile 225 230 235 240 Phe Trp Leu Lys Lys Arg Gly Leu Met Pro Gly Leu Thr Phe Ser Asn 245 250 255 Glu Leu Ile Ser Arg Asp Glu Gly Leu His Cys Asp Phe Ala Cys Leu 260 265 270 Met Phe Lys His Leu Ile Asn Lys Pro Ser Glu Glu Thr Val Lys Lys 275 280 285 Ile Ile Met Asn Ala Val Glu Ile Glu Gln Glu Phe Leu Thr Asp Ala 290 295 300 Leu Pro Val Lys Leu Ile Gly Met Asn Cys Asp Leu Met Lys Gln Tyr 305 310 315 320 Ile Glu Phe Val Ala Asp Arg Leu Leu Leu Glu Leu Gly Phe Asp Lys 325 330 335 Val Tyr Arg Val Glu Asn Pro Phe Asp Phe Met Glu Asn Ile Ser Leu 340 345 350 Glu Gly Lys Thr Asn Phe Phe Glu Lys Arg Val Gly Glu Tyr Gln Arg 355 360 365 Met Gly Val Met Ser Gly Thr Thr Asp Asn Thr Phe Thr Leu Asp Ala 370 375 380 Asp Phe 385 77 1956 DNA Danio rerio 77 gcgcccagcc tacccataac ccaccgcgcg cgcctccatt cttctctcca gtccaacatg 60 tccatgatag acagtccgtt aagcgtgctc ggccagagga cgactggcga gtccgtgagg 120 tctcaaaacg tcatggctgc tgcatccatt gctaacatcg tgaagagctc tctcggtccc 180 gtcggattgg acaagatgtt ggtggatgac atcggagacg tgaccatcac caatgacggg 240 gccaccatcc tgaagctgct ggaggtcgaa cacccggctg ctaaagtcct ctgtgagctg 300 gcagagctgc aggacaagga ggttggagac ggcactacat cagtggtgat cattgcggct 360 gagctgctga agagtgcgga tgagctggtc aaacagaaga ttcaccccac ctccatcatc 420 agcggataca gactggcctg caaggaggct gtccgctaca ttaatgagaa tctcaccatc 480 ggcacagatg acctgggcag agagtgtctc ctcaacgccg ccaagacctc catgtcctcc 540 aagatcatcg gagttgatgc tgagttcttc gctaatatgg tggtcgacgc cgcagtggct 600 gtgaagtttg tggacggtaa aggggttgct cggtatccca tcaactctgt caatgtgctg 660 aaggctcacg gacgcagcca gaaggagagc ttcctggtca acggatacgc actcaactgc 720 actgtcggct cacaagggat ggtaaaacgt gtggccaacg ctaagatagc ctgtctggac 780 ttcagcctgc agaaaaccaa gatgaagctg ggcgttcagg tggtcatcaa tgatccggag 840 aaactcgacc aaatcagaca gagggaatcg gacatcacaa aggaacgtgt tcagaagatc 900 ctggcatcag gagcgaatgt ggttttgacc acaggaggaa tcgacgacat gtgtctgaag 960 tatttcgtgg acgtgggggc catggcggtg agaagagtcc tgaagaaaga cctcaaacgc 1020 atcgctaaag ctactggagc cacggtttgc tcttctctgt ccaacctgga gggcgaggag 1080 acattcgagc cgtccatgtt gggtcaggca gaggaggtgg tgcaggagag agtgtgtgac 1140 gacgagctca tcctcatcaa gaacacaaag gcgcgcacct ctgcctccat catcctgcgc 1200 ggggccaatg atttcatgtg tgatgagatg gagcgctctc tgcatgacgc cctgtgtgtg 1260 gtgaagcgag tgctggagtc taaatctgtg gtgccaggag gaggcgctgt ggaggccgct 1320 ttgtccatct atctggaaaa ctacgcaacc agcatgggct cccgtgagca gctggctatt 1380 gcagagtttg ctcgctctct gctggtcatt cctaaaactc tggccgtgaa cgcagcgcag 1440 gactccactg acctggtggc caaactcaga gcctttcaca atgaggctca ggtcaacccc 1500 gatcgcaaga acctcaaatg gattggtttg gatctggtaa atggaaagcc aagggataac 1560 aagcaggcag gagtgtacga acccacgatg gtcaaaacca agagtctgaa gttcgccacc 1620 gaggcggcca ttaccatcct tcgaatcgat gacctcatca agctcttccc tgaccagaaa 1680 gaaggcgggc cctcctatca ggacgccgtc caatcaggct ctctagaggg ctaaggtctc 1740 tcattggctg attctggtgt cactccgctc agctttgtat ttatatgtgg atccaactga 1800 tgagctgtta gtagagattg tgcacttttt tgtttgcgtt acttatattt ttgctattag 1860 catggctgtc tgtcatgttt aatgtaaaaa aagttccttt ttctgccatt tcaaagaaaa 1920 taaaccagat tggtggtgtg gaaaagagaa aaaaaa 1956 78 556 PRT Danio rerio 78 Met Ile Asp Ser Pro Leu Ser Val Leu Gly Gln Arg Thr Thr Gly Glu 1 5 10 15 Ser Val Arg Ser Gln Asn Val Met Ala Ala Ala Ser Ile Ala Asn Ile 20 25 30 Val Lys Ser Ser Leu Gly Pro Val Gly Leu Asp Lys Met Leu Val Asp 35 40 45 Asp Ile Gly Asp Val Thr Ile Thr Asn Asp Gly Ala Thr Ile Leu Lys 50 55 60 Leu Leu Glu Val Glu His Pro Ala Ala Lys Val Leu Cys Glu Leu Ala 65 70 75 80 Glu Leu Gln Asp Lys Glu Val Gly Asp Gly Thr Thr Ser Val Val Ile 85 90 95 Ile Ala Ala Glu Leu Leu Lys Ser Ala Asp Glu Leu Val Lys Gln Lys 100 105 110 Ile His Pro Thr Ser Ile Ile Ser Gly Tyr Arg Leu Ala Cys Lys Glu 115 120 125 Ala Val Arg Tyr Ile Asn Glu Asn Leu Thr Ile Gly Thr Asp Asp Leu 130 135 140 Gly Arg Glu Cys Leu Leu Asn Ala Ala Lys Thr Ser Met Ser Ser Lys 145 150 155 160 Ile Ile Gly Val Asp Ala Glu Phe Phe Ala Asn Met Val Val Asp Ala 165 170 175 Ala Val Ala Val Lys Phe Val Asp Gly Lys Gly Val Ala Arg Tyr Pro 180 185 190 Ile Asn Ser Val Asn Val Leu Lys Ala His Gly Arg Ser Gln Lys Glu 195 200 205 Ser Phe Leu Val Asn Gly Tyr Ala Leu Asn Cys Thr Val Gly Ser Gln 210 215 220 Gly Met Val Lys Arg Val Ala Asn Ala Lys Ile Ala Cys Leu Asp Phe 225 230 235 240 Ser Leu Gln Lys Thr Lys Met Lys Leu Gly Val Gln Val Val Ile Asn 245 250 255 Asp Pro Glu Lys Leu Asp Gln Ile Arg Gln Arg Glu Ser Asp Ile Thr 260 265 270 Lys Glu Arg Val Gln Lys Ile Leu Ala Ser Gly Ala Asn Val Val Leu 275 280 285 Thr Thr Gly Gly Ile Asp Asp Met Cys Leu Lys Tyr Phe Val Asp Val 290 295 300 Gly Ala Met Ala Val Arg Arg Val Leu Lys Lys Asp Leu Lys Arg Ile 305 310 315 320 Ala Lys Ala Thr Gly Ala Thr Val Cys Ser Ser Leu Ser Asn Leu Glu 325 330 335 Gly Glu Glu Thr Phe Glu Pro Ser Met Leu Gly Gln Ala Glu Glu Val 340 345 350 Val Gln Glu Arg Val Cys Asp Asp Glu Leu Ile Leu Ile Lys Asn Thr 355 360 365 Lys Ala Arg Thr Ser Ala Ser Ile Ile Leu Arg Gly Ala Asn Asp Phe 370 375 380 Met Cys Asp Glu Met Glu Arg Ser Leu His Asp Ala Leu Cys Val Val 385 390 395 400 Lys Arg Val Leu Glu Ser Lys Ser Val Val Pro Gly Gly Gly Ala Val 405 410 415 Glu Ala Ala Leu Ser Ile Tyr Leu Glu Asn Tyr Ala Thr Ser Met Gly 420 425 430 Ser Arg Glu Gln Leu Ala Ile Ala Glu Phe Ala Arg Ser Leu Leu Val 435 440 445 Ile Pro Lys Thr Leu Ala Val Asn Ala Ala Gln Asp Ser Thr Asp Leu 450 455 460 Val Ala Lys Leu Arg Ala Phe His Asn Glu Ala Gln Val Asn Pro Asp 465 470 475 480 Arg Lys Asn Leu Lys Trp Ile Gly Leu Asp Leu Val Asn Gly Lys Pro 485 490 495 Arg Asp Asn Lys Gln Ala Gly Val Tyr Glu Pro Thr Met Val Lys Thr 500 505 510 Lys Ser Leu Lys Phe Ala Thr Glu Ala Ala Ile Thr Ile Leu Arg Ile 515 520 525 Asp Asp Leu Ile Lys Leu Phe Pro Asp Gln Lys Glu Gly Gly Pro Ser 530 535 540 Tyr Gln Asp Ala Val Gln Ser Gly Ser Leu Glu Gly 545 550 555 79 2232 DNA Danio rerio 79 gatgaacatc tggagaacca cagacaagaa gaacatctgg agaaccacag acaagacaac 60 atagtaactg aagacaacca tctggagaac cacagacaag acaacataga cactgaagat 120 gaacatctgg agaaccacag agaagacaac atagacactg aagatgaaca tctggagaac 180 cacagacgag aagaacatct ggagagcttc aggtcacgcc cactgattgt gtaatcacca 240 ccgcccaatg gcagctcaaa gtgacgttcg cgccgtcagc ttcgctgctg gccaatcggt 300 gccgcctcgc cgctaagccc cgcccctgtc ctgcagtctg agcgcgcccg cgggagttat 360 gctgccgagg ctcacattaa atctgcaaga atgagcgaca aaccctgcga accgagctgg 420 gagcagatca tcaaccgctg gagcttcgat ttctacgcgt tccaggcctt cagcgcgttc 480 aggaatggag attacacggc cttcacacac ttcatcaaca tcatcgagag tgtggtggtc 540 aggccggtgg acggacacac tgacatgatc ccaaagctgc gcctcatgca gtttctgtcc 600 aggatcaaca acggggacaa gctggatctg accttcgagg agcccaagac ccctctggag 660 tcggctctgg acgtgctgga gagcatctgc agagacatga gcgtcccgca cacagagcag 720 cagctcatac gccacgccat cagcgagatg ctggtgatgg tgtgtatcag gagcggtgag 780 ttcgagaagg ctgaggagat gctcaaacac ttcagctcct cgtcagactc tgctgggaag 840 aagaagctgc tgctgcacct gatccgctgc cggcgctcct ctcacacagt gctgcagctc 900 agctcctaca gccacttcaa gcaggacatg ctgcagttcg tggagcggct cttcagcgtc 960 cccgagccct tcctgctgca gatgctgagg cgctctggat ggcccagcca gaggccggag 1020 gtcaggagat ccactcaaaa caaccggaga ttcacagtga agcccagcaa acgcacacct 1080 ccacacagca gcgcagcaga ggatcatggg gcggagtgtg tgtctgcagt gagtcggcag 1140 ggtctgcagc aggtgtttga tctgctgagt gagcagtacg gcgtgggcat ctccttcttc 1200 cagctgcagg acagtgtgga ggcggaggcg cagcagcagc aggagggagg agtggcgggc 1260 cccgagctgt gcctgacgct gagcgagacc cccatgcaga ctgtgatcct cagcgacacg 1320 gaggagcagg acgagccgcg caggtacgcc ggcatgacca tctcacgcct cgtgcaggag 1380 gaggacagcc agatcagcgc tgaacacaca cacatagagg acgaggagga ggaggaggag 1440 cagacacaca cacaccgcgg aggagactgc agtttatcag agccgcagtg ttcctccact 1500 cagcgctcga ctcctgcacg tctctgcaaa cgctcaaaca cacgtgtgca gcaaagtgaa 1560 tcagagtctc agagttcacc agctcatcgc tcgactcctg cacgtgcttc acatcccagc 1620 aaccctcgct ccagcacacg caagaaaaca ccgagccggc gcaggattat tgagtccagt 1680 gactctgaga atgagcagac aactcctgca gccgcaacac acacacctag aggacagcgg 1740 cagcagcaca ggtctatccg tgtgtccagc gggtctgaac cggaggtgga cagcacatct 1800 cctgcagcac acacacgccg ctccacaccc accagcacac acacgcgcac caccaagagg 1860 tctaaatggc tggatgtatc ggggattcag gataactgga gtgatgagga ttctctgttt 1920 cacacttcta cagctccagc taaaaagtac accaggaaga tgtggtccgt gcaggagtcg 1980 gagtggctga agcagggtgt ggtccgctat ggtgtgggac actgggagag gatccgctcc 2040 gccttcccct tcgccggacg caccgctgta aacctgaagg accgctggag gaccatggtc 2100 aagctcaaga tggtctgatg tgtgtgtttg cgagcgtgcg tgtgtgtgtg tgtatgtttc 2160 agcacaatca tgaagtttta gactgatgtt ttataacagt tctgttcata ttaaatgttt 2220 ctgatgtttg gc 2232 80 575 PRT Danio rerio 80 Met Ser Asp Lys Pro Cys Glu Pro Ser Trp Glu Gln Ile Ile Asn Arg 1 5 10 15 Trp Ser Phe Asp Phe Tyr Ala Phe Gln Ala Phe Ser Ala Phe Arg Asn 20 25 30 Gly Asp Tyr Thr Ala Phe Thr His Phe Ile Asn Ile Ile Glu Ser Val 35 40 45 Val Val Arg Pro Val Asp Gly His Thr Asp Met Ile Pro Lys Leu Arg 50 55 60 Leu Met Gln Phe Leu Ser Arg Ile Asn Asn Gly Asp Lys Leu Asp Leu 65 70 75 80 Thr Phe Glu Glu Pro Lys Thr Pro Leu Glu Ser Ala Leu Asp Val Leu 85 90 95 Glu Ser Ile Cys Arg Asp Met Ser Val Pro His Thr Glu Gln Gln Leu 100 105 110 Ile Arg His Ala Ile Ser Glu Met Leu Val Met Val Cys Ile Arg Ser 115 120 125 Gly Glu Phe Glu Lys Ala Glu Glu Met Leu Lys His Phe Ser Ser Ser 130 135 140 Ser Asp Ser Ala Gly Lys Lys Lys Leu Leu Leu His Leu Ile Arg Cys 145 150 155 160 Arg Arg Ser Ser His Thr Val Leu Gln Leu Ser Ser Tyr Ser His Phe 165 170 175 Lys Gln Asp Met Leu Gln Phe Val Glu Arg Leu Phe Ser Val Pro Glu 180 185 190 Pro Phe Leu Leu Gln Met Leu Arg Arg Ser Gly Trp Pro Ser Gln Arg 195 200 205 Pro Glu Val Arg Arg Ser Thr Gln Asn Asn Arg Arg Phe Thr Val Lys 210 215 220 Pro Ser Lys Arg Thr Pro Pro His Ser Ser Ala Ala Glu Asp His Gly 225 230 235 240 Ala Glu Cys Val Ser Ala Val Ser Arg Gln Gly Leu Gln Gln Val Phe 245 250 255 Asp Leu Leu Ser Glu Gln Tyr Gly Val Gly Ile Ser Phe Phe Gln Leu 260 265 270 Gln Asp Ser Val Glu Ala Glu Ala Gln Gln Gln Gln Glu Gly Gly Val 275 280 285 Ala Gly Pro Glu Leu Cys Leu Thr Leu Ser Glu Thr Pro Met Gln Thr 290 295 300 Val Ile Leu Ser Asp Thr Glu Glu Gln Asp Glu Pro Arg Arg Tyr Ala 305 310 315 320 Gly Met Thr Ile Ser Arg Leu Val Gln Glu Glu Asp Ser Gln Ile Ser 325 330 335 Ala Glu His Thr His Ile Glu Asp Glu Glu Glu Glu Glu Glu Gln Thr 340 345 350 His Thr His Arg Gly Gly Asp Cys Ser Leu Ser Glu Pro Gln Cys Ser 355 360 365 Ser Thr Gln Arg Ser Thr Pro Ala Arg Leu Cys Lys Arg Ser Asn Thr 370 375 380 Arg Val Gln Gln Ser Glu Ser Glu Ser Gln Ser Ser Pro Ala His Arg 385 390 395 400 Ser Thr Pro Ala Arg Ala Ser His Pro Ser Asn Pro Arg Ser Ser Thr 405 410 415 Arg Lys Lys Thr Pro Ser Arg Arg Arg Ile Ile Glu Ser Ser Asp Ser 420 425 430 Glu Asn Glu Gln Thr Thr Pro Ala Ala Ala Thr His Thr Pro Arg Gly 435 440 445 Gln Arg Gln Gln His Arg Ser Ile Arg Val Ser Ser Gly Ser Glu Pro 450 455 460 Glu Val Asp Ser Thr Ser Pro Ala Ala His Thr Arg Arg Ser Thr Pro 465 470 475 480 Thr Ser Thr His Thr Arg Thr Thr Lys Arg Ser Lys Trp Leu Asp Val 485 490 495 Ser Gly Ile Gln Asp Asn Trp Ser Asp Glu Asp Ser Leu Phe His Thr 500 505 510 Ser Thr Ala Pro Ala Lys Lys Tyr Thr Arg Lys Met Trp Ser Val Gln 515 520 525 Glu Ser Glu Trp Leu Lys Gln Gly Val Val Arg Tyr Gly Val Gly His 530 535 540 Trp Glu Arg Ile Arg Ser Ala Phe Pro Phe Ala Gly Arg Thr Ala Val 545 550 555 560 Asn Leu Lys Asp Arg Trp Arg Thr Met Val Lys Leu Lys Met Val 565 570 575 81 4135 DNA Danio rerio 81 ttttattttc gagttgttgc gtttgttgaa tgccctcgtc ttgtgtttca ttgtttattg 60 acaacaaccg gtatttctcc gttttctatc acgccaaacg ggttaaatta ttaacccaga 120 aatcattagt cgcgtcttgc cgcggttaaa accgctcgag agcctgggag gacagtttta 180 acatatttgt ggttcatatt cacgctcaca cccgtacgac tttcgattaa aacacgacga 240 aggcatgtgc aaaccgtcca aacaacctgc aggatgaacc gtgtacaagt ggattttaaa 300 gggttgccag cccacatctt ggagaacagc attgcagcag aaagccttca aaacactagg 360 tcatcagaca atgttctcac tccattgaca tttccaaaat ctaaagtggc tctgtgggat 420 ccatctgcca atggtgaagt tgtgagtctg catttctcgt attacagaaa tccgagactc 480 ttcttggtag aaaaggcatt acgtttggct caccgtcatg ctagacagac caacaagcct 540 cgatttttct gttttctgct tgggacactt gcagtagaca gtgatgagga aggagtgact 600 ataaccctgg atcgttttga tccaggtaga gagcagacgg gatgtcttgg gaaagcacca 660 actgctctgc tcccaggaga catccttgta ccatgtgtat tcgaagccca gcatgcagct 720 agcagcactg tacactcgag tgaggatttg aacatctcgt ttaagatgct tcaacatttc 780 tgctgcagta aagagttgct ggagttgtcc aaactcctca ccttgcgtgc tcaactcagc 840 tgttctgaga acatggacag gctcaccttt aacttatcct gggctgctgt gactttggcc 900 tgcacgctag atgctgtccc tattagggct gtgcccatca tccctaccgc actggccagg 960 aacctcagta gccctgcagg tgtgacccaa aacagtaaac gtgggtttct aaccatggat 1020 cagaccagga agcttctcct catacttgaa tctgatccca aggcttacac tctccctcta 1080 gttggaatat ggttgagtgg ggttacacac attcataatc ccttggtgtg ggcgtggtgt 1140 ttgaggtacc tgcacagctc ctccctcctg gacaaggttt tgtcagaggg aggcaccttc 1200 ctggtggtcc tgtattccct gactcatcga gatccggagt tttaccagtg taagccaagc 1260 actggacagc agcaactgag ctttcagctg ctcacaagca gagagtcact cacactttat 1320 aagaatgttg agccttcaga aagacgacct ttacaatttg agctcagctc tgaaaatcaa 1380 aatcaggaaa ctgtgttgtt tgaggaagtg ctctcacagt ctgtgttaac agggacgact 1440 cttggagcag catctgctgc tcctcagaat aagctgtcta tcagtgatca tgactcaggt 1500 gtagaagatg aagacctctc tcctcgacca tctccaaacc ctcatccctg ttagtcagca 1560 gactaaaggt gttcatcctt tagtaccaga gctctccatg gttttagatg gcagcttttt 1620 ggatgggagt gttgttaata ctcaaggctc aactcccctt tctcatagtc aatcaaatgt 1680 tcataggaga aacagtagtc ctgccctcca gggtctctct gttctgagac ctctagtaca 1740 gggaagcgtt actggacccc ctccaatccg aagaccacta actcccattc tttctcagcc 1800 gaagaacaag ttacatccta atccatcaca acaaacacca caacacagtg tgagtcgaaa 1860 atccctgccc tcaatgagaa gatcaagaga aggctcttca gcatcatcag tatcatcatc 1920 ttcatcctct tcctcaacaa aaaacgcttc acccaatggc tcttttcatc aacaaaggca 1980 gcgtttatct cagggtttcc caaacaaacc ccagctgatt tattctgggg cctccaacat 2040 caggtcatag cagtgccaaa aagagttcat cagtacctag ccagacccct gtccctcatc 2100 cttcccaaca tagaatcttc catagcaccc cagctgtaaa tccttgcaac tgtgcaccaa 2160 ccaccccagc gttgctcctg tccaggattt cttcaacatc caacatctcc cacaaacaca 2220 acacgtggtt tggttttgga gtccaaatat ttcagagtcc ggtgttggga gagagtgcaa 2280 gcatgtatta tcaatctcag tcacaaagca aagatctgtc tgaaaataga gaaattgacg 2340 acccaaggtt ttaccacgag ctgttgggtc aggtgcaaag ccgtctgcaa gactctgtga 2400 ttgtggagga taaggtagag caggaccaac agagtctcct gaaaagacaa agtctgtccc 2460 ctgttggcca gcagtcaagg aaaccactga caacttcttc cataccccag actcaaaaaa 2520 caaagcagcc atctagtcca cctaatcagg atcgtgtctt aagtgcaaca ttaaaacagc 2580 tacaacagtt tggggtgaac atcgacttag actcttctca ggaaaaaacg acacgtgcca 2640 ctgtggagag tgccagtacc cttgcctgca taaaccctga agcagtgatt ccaagactag 2700 ctctttctga accagtgggt gccagcattt gggggccaag tggcagcgta gaccttagtc 2760 ttgaggccaa tgccattgca cttaagtacc taagcgactc acaactttca agactctcac 2820 tgggcagtca gtcctctagc ccacattcag accccagcac aattctcctg agaagaccgg 2880 ctgtggaaaa gagcaatgtt gcactcagta tcctgtcctc cagcaatatg tccctagcta 2940 cctgtaaata catgaagaaa tatggactga ttgaaggaga aatcagcagt gaggaagagc 3000 aggaggaccc catccaggta gattctgctc ttggatgttc agtacaacat gaaacatcta 3060 aaacaatcag ccttaggcaa gaacgtgaag aacaaaacac agccgttcta aaaaacataa 3120 caaacaagcc gattgtcaat ctccatacat ctcccattga ctctcaagag cagatactac 3180 aagacttgag gcctaaaatg cagctgcttt tgcgtggcgg gacgaactca gagaaggaga 3240 atgccacaaa gagagatctg attgaacgca ggtcctcgct tactgaaaac cggagaacac 3300 aggaagttgt ggaccctcag gggtcagtgg gaaatttcct agatctgagt aggttacggc 3360 agctccccaa gctcttttaa agtgcatcat gggtgttttc ctaatcgggc tgcttctgtg 3420 catcagaggg gtttgagaga ttaatgcaat cctcaaatgt ttgtttacat ggcataccat 3480 gaatgaattt tatttttaag tgcgcttttc atttaagctt ttaagtatgt tttaatgtct 3540 ttaactttgt cttgttttgt ctaacccaaa gggatagttt gcccacaaat taaaatttac 3600 cctcttttta ttcctctttc aaaggtttta aaccctttta agtttcttta ttctgttgaa 3660 cacaaaagat gatatttgta agaaatctga aaacctgtca ttgacttcca tagtatttat 3720 ttttcttact atggtactaa atgtttacag gttttccttt ttcttcaaaa tcatctttcg 3780 tttgtttttc atattttgtg ctccactaaa gacaaaaaaa cctaaaaggt ttaaaacttt 3840 taattattgt gtgaactatc cctttaacag gtgctatttc cttttatgtg gaaatatttg 3900 tcttcacctg atgactcctt atgtgtcaat aactaagtgc cttgtatgaa gtgattcact 3960 taaggcagtg ttgtggtttt gtactatatt taaacacact tgtgttttgg cattacttga 4020 tgtttctgaa attggtgcaa acatttctac gtttttcaat gacttcatgt acatgtatct 4080 gtcttataat taaattacat attcttttac acaaaaaaaa aaaaaaaaaa aaaaa 4135 82 1044 PRT Danio rerio 82 Met Asn Arg Val Gln Val Asp Phe Lys Gly Leu Pro Ala His Ile Leu 1 5 10 15 Glu Asn Ser Ile Ala Ala Glu Ser Leu Gln Asn Thr Arg Ser Ser Asp 20 25 30 Asn Val Leu Thr Pro Leu Thr Phe Pro Lys Ser Lys Val Ala Leu Trp 35 40 45 Asp Pro Ser Ala Asn Gly Glu Val Val Ser Leu His Phe Ser Tyr Tyr 50 55 60 Arg Asn Pro Arg Leu Phe Leu Val Glu Lys Ala Leu Arg Leu Ala His 65 70 75 80 Arg His Ala Arg Gln Thr Asn Lys Pro Arg Phe Phe Cys Phe Leu Leu 85 90 95 Gly Thr Leu Ala Val Asp Ser Asp Glu Glu Gly Val Thr Ile Thr Leu 100 105 110 Asp Arg Phe Asp Pro Gly Arg Glu Gln Thr Gly Cys Leu Gly Lys Ala 115 120 125 Pro Thr Ala Leu Leu Pro Gly Asp Ile Leu Val Pro Cys Val Phe Glu 130 135 140 Ala Gln His Ala Ala Ser Ser Thr Val His Ser Ser Glu Asp Leu Asn 145 150 155 160 Ile Ser Phe Lys Met Leu Gln His Phe Cys Cys Ser Lys Glu Leu Leu 165 170 175 Glu Leu Ser Lys Leu Leu Thr Leu Arg Ala Gln Leu Ser Cys Ser Glu 180 185 190 Asn Met Asp Arg Leu Thr Phe Asn Leu Ser Trp Ala Ala Val Thr Leu 195 200 205 Ala Cys Thr Leu Asp Ala Val Pro Ile Arg Ala Val Pro Ile Ile Pro 210 215 220 Thr Ala Leu Ala Arg Asn Leu Ser Ser Pro Ala Gly Val Thr Gln Asn 225 230 235 240 Ser Lys Arg Gly Phe Leu Thr Met Asp Gln Thr Arg Lys Leu Leu Leu 245 250 255 Ile Leu Glu Ser Asp Pro Lys Ala Tyr Thr Leu Pro Leu Val Gly Ile 260 265 270 Trp Leu Ser Gly Val Thr His Ile His Asn Pro Leu Val Trp Ala Trp 275 280 285 Cys Leu Arg Tyr Leu His Ser Ser Ser Leu Leu Asp Lys Val Leu Ser 290 295 300 Glu Gly Gly Thr Phe Leu Val Val Leu Tyr Ser Leu Thr His Arg Asp 305 310 315 320 Pro Glu Phe Tyr Gln Cys Lys Pro Ser Thr Gly Gln Gln Gln Leu Ser 325 330 335 Phe Gln Leu Leu Thr Ser Arg Glu Ser Leu Thr Leu Tyr Lys Asn Val 340 345 350 Glu Pro Ser Glu Arg Arg Pro Leu Gln Phe Glu Leu Ser Ser Glu Asn 355 360 365 Gln Asn Gln Glu Thr Val Leu Phe Glu Glu Val Leu Ser Gln Ser Val 370 375 380 Leu Thr Gly Thr Thr Leu Gly Ala Ala Ser Ala Ala Pro Gln Asn Lys 385 390 395 400 Leu Ser Ile Ser Asp His Asp Ser Gly Val Glu Asp Glu Asp Leu Ser 405 410 415 Pro Arg Pro Ser Pro Asn Pro His Pro Cys Ser Gln Gln Thr Lys Gly 420 425 430 Val His Pro Leu Val Pro Glu Leu Ser Met Val Leu Asp Gly Ser Phe 435 440 445 Leu Asp Gly Ser Val Val Asn Thr Gln Gly Ser Thr Pro Leu Ser His 450 455 460 Ser Gln Ser Asn Val His Arg Arg Asn Ser Ser Pro Ala Leu Gln Gly 465 470 475 480 Leu Ser Val Leu Arg Pro Leu Val Gln Gly Ser Val Thr Gly Pro Pro 485 490 495 Pro Ile Arg Arg Pro Leu Thr Pro Ile Leu Ser Gln Pro Lys Asn Lys 500 505 510 Leu His Pro Asn Pro Ser Gln Gln Thr Pro Gln His Ser Val Ser Arg 515 520 525 Lys Ser Leu Pro Ser Met Arg Arg Ser Arg Glu Gly Ser Ser Ala Ser 530 535 540 Ser Val Ser Ser Ser Ser Ser Ser Ser Ser Thr Lys Asn Ala Ser Pro 545 550 555 560 Asn Gly Ser Phe His Gln Gln Arg Gln Arg Leu Ser Gln Gly Phe Pro 565 570 575 Asn Lys Pro Gln Leu Ile Tyr Ser Gly Ala Ser Asn Ile Arg Ser Phe 580 585 590 Ile Leu Gly Pro Pro Thr Ser Gly His Ser Ser Ala Lys Lys Ser Ser 595 600 605 Ser Val Pro Ser Gln Thr Pro Val Pro His Pro Ser Gln His Arg Ile 610 615 620 Phe His Ser Thr Pro Ala Val Asn Pro Cys Asn Cys Ala Pro Thr Thr 625 630 635 640 Pro Ala Leu Leu Leu Ser Arg Ile Ser Ser Thr Ser Asn Ile Ser His 645 650 655 Lys His Asn Thr Trp Phe Gly Phe Gly Val Gln Ile Phe Gln Ser Pro 660 665 670 Val Leu Gly Glu Ser Ala Ser Met Tyr Tyr Gln Ser Gln Ser Gln Ser 675 680 685 Lys Asp Leu Ser Glu Asn Arg Glu Ile Asp Asp Pro Arg Phe Tyr His 690 695 700 Glu Leu Leu Gly Gln Val Gln Ser Arg Leu Gln Asp Ser Val Ile Val 705 710 715 720 Glu Asp Lys Val Glu Gln Asp Gln Gln Ser Leu Leu Lys Arg Gln Ser 725 730 735 Leu Ser Pro Val Gly Gln Gln Ser Arg Lys Pro Leu Thr Thr Ser Ser 740 745 750 Ile Pro Gln Thr Gln Lys Thr Lys Gln Pro Ser Ser Pro Pro Asn Gln 755 760 765 Asp Arg Val Leu Ser Ala Thr Leu Lys Gln Leu Gln Gln Phe Gly Val 770 775 780 Asn Ile Asp Leu Asp Ser Ser Gln Glu Lys Thr Thr Arg Ala Thr Val 785 790 795 800 Glu Ser Ala Ser Thr Leu Ala Cys Ile Asn Pro Glu Ala Val Ile Pro 805 810 815 Arg Leu Ala Leu Ser Glu Pro Val Gly Ala Ser Ile Trp Gly Pro Ser 820 825 830 Gly Ser Val Asp Leu Ser Leu Glu Ala Asn Ala Ile Ala Leu Lys Tyr 835 840 845 Leu Ser Asp Ser Gln Leu Ser Arg Leu Ser Leu Gly Ser Gln Ser Ser 850 855 860 Ser Pro His Ser Asp Pro Ser Thr Ile Leu Leu Arg Arg Pro Ala Val 865 870 875 880 Glu Lys Ser Asn Val Ala Leu Ser Ile Leu Ser Ser Ser Asn Met Ser 885 890 895 Leu Ala Thr Cys Lys Tyr Met Lys Lys Tyr Gly Leu Ile Glu Gly Glu 900 905 910 Ile Ser Ser Glu Glu Glu Gln Glu Asp Pro Ile Gln Val Asp Ser Ala 915 920 925 Leu Gly Cys Ser Val Gln His Glu Thr Ser Lys Thr Ile Ser Leu Arg 930 935 940 Gln Glu Arg Glu Glu Gln Asn Thr Ala Val Leu Lys Asn Ile Thr Asn 945 950 955 960 Lys Pro Ile Val Asn Leu His Thr Ser Pro Ile Asp Ser Gln Glu Gln 965 970 975 Ile Leu Gln Asp Leu Arg Pro Lys Met Gln Leu Leu Leu Arg Gly Gly 980 985 990 Thr Asn Ser Glu Lys Glu Asn Ala Thr Lys Arg Asp Leu Ile Glu Arg 995 1000 1005 Arg Ser Ser Leu Thr Glu Asn Arg Arg Thr Gln Glu Val Val Asp Pro 1010 1015 1020 Gln Gly Ser Val Gly Asn Phe Leu Asp Leu Ser Arg Leu Arg Gln Leu 1025 1030 1035 1040 Pro Lys Leu Phe 83 773 DNA Danio rerio 83 ggggacatcc ggtggttttc gccccgcgct gttccgctgt gaagatgccg aagttttatt 60 gcgattactg tgacacatac ctgacacatg attcgccctc ggtcagaaag acacactgca 120 gtggacgcaa acacaaagaa aatgtgaagg attattatca gaagtggatg gaggagcaag 180 ctcagtctct cattgacaaa acgacggctg cattccagca gggtaaaatt ccgcccaccc 240 cgttccctgg agcccctcca cctggcggat ccctgctacc tcatccaagc attggcggac 300 ctccgagacc gggcatgtta ccagctccac cgatgggtgg tccacctatg atgcccatga 360 tgggcccccc tcctcatgcc atgatgcctg gaggaccagg tccaggtatg cggccaccaa 420 tgggtgggcc tatgcaaatg atgcctgggc cacacatgat gcgacctcct gctcggccaa 480 tgatgccagc cgttagacct ggtatggtgc gtcctgatcg ataaaactac agtgccatga 540 cttccaaccc gatacccacg gaagccaaat gtctaaactg gttttacagt cgatttgaag 600 agttttattg cccatttgcg ttttatgatg actgatgtct gcggttcagg aaacctcgtt 660 ccgcttgagt gaccgggatt atcacaaatt cttgtaaaat ttaatgaacc ttccattggc 720 tttcatttca tatttatgac agaccagagg gtgttgcgta atgggtttgt gta 773 84 159 PRT Danio rerio 84 Met Pro Lys Phe Tyr Cys Asp Tyr Cys Asp Thr Tyr Leu Thr His Asp 1 5 10 15 Ser Pro Ser Val Arg Lys Thr His Cys Ser Gly Arg Lys His Lys Glu 20 25 30 Asn Val Lys Asp Tyr Tyr Gln Lys Trp Met Glu Glu Gln Ala Gln Ser 35 40 45 Leu Ile Asp Lys Thr Thr Ala Ala Phe Gln Gln Gly Lys Ile Pro Pro 50 55 60 Thr Pro Phe Pro Gly Ala Pro Pro Pro Gly Gly Ser Leu Leu Pro His 65 70 75 80 Pro Ser Ile Gly Gly Pro Pro Arg Pro Gly Met Leu Pro Ala Pro Pro 85 90 95 Met Gly Gly Pro Pro Met Met Pro Met Met Gly Pro Pro Pro His Ala 100 105 110 Met Met Pro Gly Gly Pro Gly Pro Gly Met Arg Pro Pro Met Gly Gly 115 120 125 Pro Met Gln Met Met Pro Gly Pro His Met Met Arg Pro Pro Ala Arg 130 135 140 Pro Met Met Pro Ala Val Arg Pro Gly Met Val Arg Pro Asp Arg 145 150 155 85 1696 DNA Danio rerio 85 gcggggacag aatccaagat gtcgcttaca agttttcttc ctgcgcccac ccaactgtcc 60 caggaccagc tggaggcaga ggagaggtct caggcataca gatcccagtc cactgctttg 120 gtctccaccc gcagagaccc tcctccttac ggcttcagaa agtcatgggt accccgggct 180 ctagaggact ttggagatgg aggagctttt ccagagatcc atgttgccca gtatcctctg 240 gaaatgggta ggaagaaaag gacgtccaat gctctggcag tgcaggtgga tgcagagggc 300 aagatcaaat atgatgctat tgctagacaa ggccaaagca aagacaaggt catatttagt 360 aaatacacag acatgcttcc taaggaagtg ctaaaccaag atgatccaga gctgcagaaa 420 ccagatgaag aggcagtgag agagcttaca gaaaagacca gattagctct ggagaaacag 480 gtttcacaga aaatcgcggc agccatgcct gtgcgtgctg cagataaaca ggcccccgcg 540 caatacattc ggtacacacc ctctcagcaa ggtctcgcat ttaactctgg agcaaaacag 600 agggtcatcc gtatggtgga aatgcagaaa gatccaatgg aaccccctcg attcaaaatc 660 aataagaaaa ttcctcgtgg acctccttcc cctcctgctc cagtcatgca ctctccaagc 720 agaaagatga cagtgaaaga acagcaggaa tggaagattc ctccttgtat ttccaactgg 780 aagaacgcaa agggttacac cattcctcta gacaagcgtt tggcggctga cggacgaggc 840 ctccagaccg tccacatcaa tgaaaacttt gccaaactgg cagaggcttt gtacatagca 900 gacagaaagg ccagagaggc tgtggagatg agagctcaag tggaaaagaa gatggcccag 960 aaggaaaaag aaaagaaaga ggagaagctg agagagctgg ctaagatggc cagagacagg 1020 agagctggta tcaaacccca cggtgataaa ggtggtgaag actccgaagt cagggagcgt 1080 gatgagattc gtcacgatag gaggaaagat aggcagcatg acaggaacat ctccagagcc 1140 gcccccgata agaggtcgaa gcttcagagg gaccaagaca gggacatcag tgagctcatt 1200 gctctcggcc agccgaaccc gcgcacctct agtgaggctc agtatgacca gagactcttc 1260 aatcagagca agggaatgga cagcggtttt gctgggggag aggatgagat gtataatgtg 1320 tacgatcagc ccttcagagg tggcagagac atggcacaga atatttacag gcccagtaag 1380 aatgtggata aagacatgta tggagatgac ctggacacac tcatgcagaa caacaggttt 1440 gttcccgacc gagacttctc aggtgctgac catggccccc gccgggatgg tcctgtacag 1500 tttgaggagg atcctttcgg tctggacaag ttcttggagg aagccaagca gcacggaggc 1560 tccaaaagag cgtccactag tgggcgctcc aaggactatg accatgagaa gaaacgcagg 1620 aaggagtgag acactttctc tgtcaaactt caggcttaaa tccttcttcc catacttttt 1680 tttttgtaac tgaata 1696 86 536 PRT Danio rerio 86 Met Ser Leu Thr Ser Phe Leu Pro Ala Pro Thr Gln Leu Ser Gln Asp 1 5 10 15 Gln Leu Glu Ala Glu Glu Arg Ser Gln Ala Tyr Arg Ser Gln Ser Thr 20 25 30 Ala Leu Val Ser Thr Arg Arg Asp Pro Pro Pro Tyr Gly Phe Arg Lys 35 40 45 Ser Trp Val Pro Arg Ala Leu Glu Asp Phe Gly Asp Gly Gly Ala Phe 50 55 60 Pro Glu Ile His Val Ala Gln Tyr Pro Leu Glu Met Gly Arg Lys Lys 65 70 75 80 Arg Thr Ser Asn Ala Leu Ala Val Gln Val Asp Ala Glu Gly Lys Ile 85 90 95 Lys Tyr Asp Ala Ile Ala Arg Gln Gly Gln Ser Lys Asp Lys Val Ile 100 105 110 Phe Ser Lys Tyr Thr Asp Met Leu Pro Lys Glu Val Leu Asn Gln Asp 115 120 125 Asp Pro Glu Leu Gln Lys Pro Asp Glu Glu Ala Val Arg Glu Leu Thr 130 135 140 Glu Lys Thr Arg Leu Ala Leu Glu Lys Gln Val Ser Gln Lys Ile Ala 145 150 155 160 Ala Ala Met Pro Val Arg Ala Ala Asp Lys Gln Ala Pro Ala Gln Tyr 165 170 175 Ile Arg Tyr Thr Pro Ser Gln Gln Gly Leu Ala Phe Asn Ser Gly Ala 180 185 190 Lys Gln Arg Val Ile Arg Met Val Glu Met Gln Lys Asp Pro Met Glu 195 200 205 Pro Pro Arg Phe Lys Ile Asn Lys Lys Ile Pro Arg Gly Pro Pro Ser 210 215 220 Pro Pro Ala Pro Val Met His Ser Pro Ser Arg Lys Met Thr Val Lys 225 230 235 240 Glu Gln Gln Glu Trp Lys Ile Pro Pro Cys Ile Ser Asn Trp Lys Asn 245 250 255 Ala Lys Gly Tyr Thr Ile Pro Leu Asp Lys Arg Leu Ala Ala Asp Gly 260 265 270 Arg Gly Leu Gln Thr Val His Ile Asn Glu Asn Phe Ala Lys Leu Ala 275 280 285 Glu Ala Leu Tyr Ile Ala Asp Arg Lys Ala Arg Glu Ala Val Glu Met 290 295 300 Arg Ala Gln Val Glu Lys Lys Met Ala Gln Lys Glu Lys Glu Lys Lys 305 310 315 320 Glu Glu Lys Leu Arg Glu Leu Ala Lys Met Ala Arg Asp Arg Arg Ala 325 330 335 Gly Ile Lys Pro His Gly Asp Lys Gly Gly Glu Asp Ser Glu Val Arg 340 345 350 Glu Arg Asp Glu Ile Arg His Asp Arg Arg Lys Asp Arg Gln His Asp 355 360 365 Arg Asn Ile Ser Arg Ala Ala Pro Asp Lys Arg Ser Lys Leu Gln Arg 370 375 380 Asp Gln Asp Arg Asp Ile Ser Glu Leu Ile Ala Leu Gly Gln Pro Asn 385 390 395 400 Pro Arg Thr Ser Ser Glu Ala Gln Tyr Asp Gln Arg Leu Phe Asn Gln 405 410 415 Ser Lys Gly Met Asp Ser Gly Phe Ala Gly Gly Glu Asp Glu Met Tyr 420 425 430 Asn Val Tyr Asp Gln Pro Phe Arg Gly Gly Arg Asp Met Ala Gln Asn 435 440 445 Ile Tyr Arg Pro Ser Lys Asn Val Asp Lys Asp Met Tyr Gly Asp Asp 450 455 460 Leu Asp Thr Leu Met Gln Asn Asn Arg Phe Val Pro Asp Arg Asp Phe 465 470 475 480 Ser Gly Ala Asp His Gly Pro Arg Arg Asp Gly Pro Val Gln Phe Glu 485 490 495 Glu Asp Pro Phe Gly Leu Asp Lys Phe Leu Glu Glu Ala Lys Gln His 500 505 510 Gly Gly Ser Lys Arg Ala Ser Thr Ser Gly Arg Ser Lys Asp Tyr Asp 515 520 525 His Glu Lys Lys Arg Arg Lys Glu 530 535 87 2301 DNA Danio rerio 87 ttctggcaac atggcgacct gcttatgagc cacgctggtg ctgaagttat cgaagccgtc 60 aaaataaaca ttcttttctg acacttagta ttagttaaat agcaaaaatg tcgggtcgtc 120 acaataacaa attaccgacg aacctcccgc agttgcaaaa tctgatcaag agggatccga 180 agtcctacac cgaggagttt ttgcagcagt atcgtcatta tcagtcaaat gtggagatct 240 ttaaacacca gccggataaa gccaacaaag acctgtccga gctggtcatg tttctggctc 300 aggttggaca ctgttattta gaggagctgt cagatttccc ccagcagctg acggatctgc 360 ttttaaacta tcatacagtg ctggagtctg atctcagaat gacgttttgc aaagctctga 420 tcatgttgag gaacaaagat ctggtcagtc ccaccagtct cctggggctt ttcttcgagc 480 tgctgcgctg tcacgacaaa cttctgagaa agactttata tacccatatt gtgacggata 540 tcaaaaacat caacgcaaag cacaaaaaca acaaaatgaa aacgactttg cagaacttca 600 tgtacaccat gctgagggat tcaaacccca ttgctgccaa aatctctcta gatgtgatgg 660 ttgagcttta cagaagaaac atctggaatg atgctaaaac tgtgaacgtc atcaccacag 720 cgtgtttctc caaagtcaca aagatccttg tagctggact gaagttcttc ctgggaaaag 780 atgaggatga aaagaaagac gatgattcag aatctgagga cgagggttca acagccagag 840 atctaatgat gcgatactca actggaaaga aaacttccaa gaacaagaag aaactggaga 900 aagctatgaa agtcctgaag aaacacaaaa agaaaaaaaa agtggaggtc tttaatttct 960 ctgccataca cttgatccat gatccaaaag acttttcgga gaagctcctg aagcagctgg 1020 agagctcaaa cgagcgtttt gaggtgaaga tcatgatgat ggagctcatc tctagacttg 1080 tgggaattca tgagcttttc ctctttaact tctatccgtt tgtccagcgt ttccttcagc 1140 ctcaccagag agaggtcaca aaaattttgt tgtgtgccgc ccagtcatcc catcagcttg 1200 tcccgcctga gatcattgag ccagtcatca tgaccattgc caataacttt gttacagaca 1260 gaaattcagg ggaggtcatg acagttggaa tcaacgccat aaaggaagtg gttgcccgtt 1320 gtcctctttc tatgtcagaa gatctgcttc aggatctcac tcagtacaaa tcccacaaag 1380 ataaaaatgt tgtgatgtct gctaggggtc ttatccagct cttcagagac ctcaatccaa 1440 agatgctaac cagaaaagac aggggaagac ccacagaatc ctctaaagag gctaaaatcc 1500 acaaatatgg agaactggag gctaaagact acattcctgg agctgaagtg ctagaagcag 1560 agaaagcgga ggatgaggaa gatgaagatg ggtgggagag cgccagcatg agtgatgatg 1620 atgaagatgg agaatgggtg aatgtgcatc actcttctga tgaagaccag gcagaagtgg 1680 ctgagaaatt acagagcatc ccggaggaag agcgcaaagc taaagctgcc atggtgagca 1740 ctagtcgact ccttacgcag gatgacttta agaagatccg cgtggctcag atggcaaaag 1800 aggtgggcaa tgcgcccggg aaaggtcaga agaggaagaa cgtagacagc gatgaagaag 1860 agagaggaga gttactcagt ctaagagata ttgagcgcct tcacaagaaa cccaaatctg 1920 acaaagaaac acgattggca actgccatgg ctggacggac tgacaggaag gagtttacca 1980 aaaagagagg taaactaaac ccgtatgcca gcaccagtaa caaggagaag aaacggaaga 2040 agaacttcat gatgatgaga cacagtcaga atgttagaac taaaggcaaa cgctccttca 2100 gagagaaaca gatcgcatta agagattcac tcttgaagaa gagaaagttc aagtagttcc 2160 cacttttaat ggactggaac taatttatgc ttcgtgatct gtgtggacca ttgtgtaaag 2220 gaagacagtg aacagacttg gctgtgtggg ttttcttgtt gccttttaat aaaaacattt 2280 aaacagttaa aaaaaaaaaa a 2301 88 682 PRT Danio rerio 88 Met Ser Gly Arg His Asn Asn Lys Leu Pro Thr Asn Leu Pro Gln Leu 1 5 10 15 Gln Asn Leu Ile Lys Arg Asp Pro Lys Ser Tyr Thr Glu Glu Phe Leu 20 25 30 Gln Gln Tyr Arg His Tyr Gln Ser Asn Val Glu Ile Phe Lys His Gln 35 40 45 Pro Asp Lys Ala Asn Lys Asp Leu Ser Glu Leu Val Met Phe Leu Ala 50 55 60 Gln Val Gly His Cys Tyr Leu Glu Glu Leu Ser Asp Phe Pro Gln Gln 65 70 75 80 Leu Thr Asp Leu Leu Leu Asn Tyr His Thr Val Leu Glu Ser Asp Leu 85 90 95 Arg Met Thr Phe Cys Lys Ala Leu Ile Met Leu Arg Asn Lys Asp Leu 100 105 110 Val Ser Pro Thr Ser Leu Leu Gly Leu Phe Phe Glu Leu Leu Arg Cys 115 120 125 His Asp Lys Leu Leu Arg Lys Thr Leu Tyr Thr His Ile Val Thr Asp 130 135 140 Ile Lys Asn Ile Asn Ala Lys His Lys Asn Asn Lys Met Lys Thr Thr 145 150 155 160 Leu Gln Asn Phe Met Tyr Thr Met Leu Arg Asp Ser Asn Pro Ile Ala 165 170 175 Ala Lys Ile Ser Leu Asp Val Met Val Glu Leu Tyr Arg Arg Asn Ile 180 185 190 Trp Asn Asp Ala Lys Thr Val Asn Val Ile Thr Thr Ala Cys Phe Ser 195 200 205 Lys Val Thr Lys Ile Leu Val Ala Gly Leu Lys Phe Phe Leu Gly Lys 210 215 220 Asp Glu Asp Glu Lys Lys Asp Asp Asp Ser Glu Ser Glu Asp Glu Gly 225 230 235 240 Ser Thr Ala Arg Asp Leu Met Met Arg Tyr Ser Thr Gly Lys Lys Thr 245 250 255 Ser Lys Asn Lys Lys Lys Leu Glu Lys Ala Met Lys Val Leu Lys Lys 260 265 270 His Lys Lys Lys Lys Lys Val Glu Val Phe Asn Phe Ser Ala Ile His 275 280 285 Leu Ile His Asp Pro Lys Asp Phe Ser Glu Lys Leu Leu Lys Gln Leu 290 295 300 Glu Ser Ser Asn Glu Arg Phe Glu Val Lys Ile Met Met Met Glu Leu 305 310 315 320 Ile Ser Arg Leu Val Gly Ile His Glu Leu Phe Leu Phe Asn Phe Tyr 325 330 335 Pro Phe Val Gln Arg Phe Leu Gln Pro His Gln Arg Glu Val Thr Lys 340 345 350 Ile Leu Leu Cys Ala Ala Gln Ser Ser His Gln Leu Val Pro Pro Glu 355 360 365 Ile Ile Glu Pro Val Ile Met Thr Ile Ala Asn Asn Phe Val Thr Asp 370 375 380 Arg Asn Ser Gly Glu Val Met Thr Val Gly Ile Asn Ala Ile Lys Glu 385 390 395 400 Val Val Ala Arg Cys Pro Leu Ser Met Ser Glu Asp Leu Leu Gln Asp 405 410 415 Leu Thr Gln Tyr Lys Ser His Lys Asp Lys Asn Val Val Met Ser Ala 420 425 430 Arg Gly Leu Ile Gln Leu Phe Arg Asp Leu Asn Pro Lys Met Leu Thr 435 440 445 Arg Lys Asp Arg Gly Arg Pro Thr Glu Ser Ser Lys Glu Ala Lys Ile 450 455 460 His Lys Tyr Gly Glu Leu Glu Ala Lys Asp Tyr Ile Pro Gly Ala Glu 465 470 475 480 Val Leu Glu Ala Glu Lys Ala Glu Asp Glu Glu Asp Glu Asp Gly Trp 485 490 495 Glu Ser Ala Ser Met Ser Asp Asp Asp Glu Asp Gly Glu Trp Val Asn 500 505 510 Val His His Ser Ser Asp Glu Asp Gln Ala Glu Val Ala Glu Lys Leu 515 520 525 Gln Ser Ile Pro Glu Glu Glu Arg Lys Ala Lys Ala Ala Met Val Ser 530 535 540 Thr Ser Arg Leu Leu Thr Gln Asp Asp Phe Lys Lys Ile Arg Val Ala 545 550 555 560 Gln Met Ala Lys Glu Val Gly Asn Ala Pro Gly Lys Gly Gln Lys Arg 565 570 575 Lys Asn Val Asp Ser Asp Glu Glu Glu Arg Gly Glu Leu Leu Ser Leu 580 585 590 Arg Asp Ile Glu Arg Leu His Lys Lys Pro Lys Ser Asp Lys Glu Thr 595 600 605 Arg Leu Ala Thr Ala Met Ala Gly Arg Thr Asp Arg Lys Glu Phe Thr 610 615 620 Lys Lys Arg Gly Lys Leu Asn Pro Tyr Ala Ser Thr Ser Asn Lys Glu 625 630 635 640 Lys Lys Arg Lys Lys Asn Phe Met Met Met Arg His Ser Gln Asn Val 645 650 655 Arg Thr Lys Gly Lys Arg Ser Phe Arg Glu Lys Gln Ile Ala Leu Arg 660 665 670 Asp Ser Leu Leu Lys Lys Arg Lys Phe Lys 675 680 89 1630 DNA Danio rerio 89 gaattcggca cgaggtccgg taccggtgat ctaaccacgc cgcaaatatg atgggccgac 60 cggttctcgt gcttagtcag aacatcaaga gagaatctgg acggaaggtc cagattggaa 120 acatctctgc ggctaagacc atagcagaca tcatcaggac atgcctagga ccaagggcta 180 tgatgaagat gctgctggat cccatgggtg gtatcgtcat gaccaatgat ggcaatgcca 240 ttttgagaga gatccaggtc cagcatcctg ctgccaagtc catgattgag attagccgca 300 cacaggatga agaagtcgga gatggtacca cctctgtcat cattcttgcg ggtgagatgc 360 tgtcagtggc tgagcagttt ttggagcagc agatgcatcc tactgtggtg atcggtgcat 420 acagacaagc tctggatgac atgttgaaca tactcaaaga cataagcact ccagtagatg 480 tgagtaatcg agacatgatg ttgaagataa taaactctgc aatcaacacc aaagcactca 540 gccgctggtc cacattggcc tgtaacatcg ctcttgatgc tgtccgtact gttgagctgg 600 acgagaatgg acgcgaagag attgacatta agaagtatgc caaagttgaa aaggttcctg 660 gtggcatcat cgaagactca tgcgtgctga ggggtgtgat ggtgaataag gatgtaactc 720 atccccgcat gcgcagactc atcaagaacc ccagaattgt tctgctggac tgttctctgg 780 agtacaagaa gggcgagagc cagactgata ttgagattgc tcgtgaggaa gacttcgccc 840 gcattctgca gatggaggaa gaatacgttc agcagatctg tgaagacatc attcggctca 900 aacctgacct aatcttcaca gagaagggca tttctgatct ggctcagcac tatctgatga 960 aagcaaacat cactgctatc cgccgtatca ggaagacgga caacaaccgc attgcaagag 1020 cgtgtggtgc acgtattgcc agcagaaccg atgaactcac tgaaaatgat gtgggtacag 1080 gaacaggcct ttttgaagtc aagaagatcg gcgacgagta cttcaccttt gtgacagagt 1140 gtaaagaccc caaagcatgt accatcctgc tcagaggagc cagcaaggaa atcttggcgg 1200 aagtggaacg aaacctacag gatgccatgc aagtgtgtcg taatgtcctg ctggacccat 1260 atctgctgcc gggtggtggc gcagtggaga tggaggtgtc ccatcggctg acggagcgct 1320 ctcgagctat gactggtgtg gagcagtggc cgtaccgcgc tgtagctcag gctctagagg 1380 ttgtccctcg cacactcatc cagaactgtg gagcttcagc cattcgtgtg ctcacttctc 1440 tgagggccaa gcacactcag gagggaaact cctcatgggg tgttaatgga gagacaggaa 1500 ctcttgctga tatggagcag ctgggaatct gggaaccgct ggctgttaaa gcccaaactt 1560 acaaaactgc agtggagacg gctatcctgt tactgcgcat tgatgacatc gtttctggcc 1620 acaagaagaa 1630 90 526 PRT Danio rerio 90 Met Gly Arg Pro Val Leu Val Leu Ser Gln Asn Ile Lys Arg Glu Ser 1 5 10 15 Gly Arg Lys Val Gln Ile Gly Asn Ile Ser Ala Ala Lys Thr Ile Ala 20 25 30 Asp Ile Ile Arg Thr Cys Leu Gly Pro Arg Ala Met Met Lys Met Leu 35 40 45 Leu Asp Pro Met Gly Gly Ile Val Met Thr Asn Asp Gly Asn Ala Ile 50 55 60 Leu Arg Glu Ile Gln Val Gln His Pro Ala Ala Lys Ser Met Ile Glu 65 70 75 80 Ile Ser Arg Thr Gln Asp Glu Glu Val Gly Asp Gly Thr Thr Ser Val 85 90 95 Ile Ile Leu Ala Gly Glu Met Leu Ser Val Ala Glu Gln Phe Leu Glu 100 105 110 Gln Gln Met His Pro Thr Val Val Ile Gly Ala Tyr Arg Gln Ala Leu 115 120 125 Asp Asp Met Leu Asn Ile Leu Lys Asp Ile Ser Thr Pro Val Asp Val 130 135 140 Ser Asn Arg Asp Met Met Leu Lys Ile Ile Asn Ser Ala Ile Asn Thr 145 150 155 160 Lys Ala Leu Ser Arg Trp Ser Thr Leu Ala Cys Asn Ile Ala Leu Asp 165 170 175 Ala Val Arg Thr Val Glu Leu Asp Glu Asn Gly Arg Glu Glu Ile Asp 180 185 190 Ile Lys Lys Tyr Ala Lys Val Glu Lys Val Pro Gly Gly Ile Ile Glu 195 200 205 Asp Ser Cys Val Leu Arg Gly Val Met Val Asn Lys Asp Val Thr His 210 215 220 Pro Arg Met Arg Arg Leu Ile Lys Asn Pro Arg Ile Val Leu Leu Asp 225 230 235 240 Cys Ser Leu Glu Tyr Lys Lys Gly Glu Ser Gln Thr Asp Ile Glu Ile 245 250 255 Ala Arg Glu Glu Asp Phe Ala Arg Ile Leu Gln Met Glu Glu Glu Tyr 260 265 270 Val Gln Gln Ile Cys Glu Asp Ile Ile Arg Leu Lys Pro Asp Leu Ile 275 280 285 Phe Thr Glu Lys Gly Ile Ser Asp Leu Ala Gln His Tyr Leu Met Lys 290 295 300 Ala Asn Ile Thr Ala Ile Arg Arg Ile Arg Lys Thr Asp Asn Asn Arg 305 310 315 320 Ile Ala Arg Ala Cys Gly Ala Arg Ile Ala Ser Arg Thr Asp Glu Leu 325 330 335 Thr Glu Asn Asp Val Gly Thr Gly Thr Gly Leu Phe Glu Val Lys Lys 340 345 350 Ile Gly Asp Glu Tyr Phe Thr Phe Val Thr Glu Cys Lys Asp Pro Lys 355 360 365 Ala Cys Thr Ile Leu Leu Arg Gly Ala Ser Lys Glu Ile Leu Ala Glu 370 375 380 Val Glu Arg Asn Leu Gln Asp Ala Met Gln Val Cys Arg Asn Val Leu 385 390 395 400 Leu Asp Pro Tyr Leu Leu Pro Gly Gly Gly Ala Val Glu Met Glu Val 405 410 415 Ser His Arg Leu Thr Glu Arg Ser Arg Ala Met Thr Gly Val Glu Gln 420 425 430 Trp Pro Tyr Arg Ala Val Ala Gln Ala Leu Glu Val Val Pro Arg Thr 435 440 445 Leu Ile Gln Asn Cys Gly Ala Ser Ala Ile Arg Val Leu Thr Ser Leu 450 455 460 Arg Ala Lys His Thr Gln Glu Gly Asn Ser Ser Trp Gly Val Asn Gly 465 470 475 480 Glu Thr Gly Thr Leu Ala Asp Met Glu Gln Leu Gly Ile Trp Glu Pro 485 490 495 Leu Ala Val Lys Ala Gln Thr Tyr Lys Thr Ala Val Glu Thr Ala Ile 500 505 510 Leu Leu Leu Arg Ile Asp Asp Ile Val Ser Gly His Lys Lys 515 520 525 91 630 DNA Danio rerio 91 gttttacgct acaatttttt ctcatctagg tggaggattt tctctcaaag tccaccgcca 60 tcatgaaact cgtcaggttt ttgatgaaat tgagccatga aaccgtcacc attgagctga 120 agaatggcac acaggtccat ggcacgatta caggtgtgga cgtcagcatg aacacacacc 180 tcaaagctgt gaagatgacc ctgaaaaaca gagagcccac tcagctggag tcgctcagca 240 tccgaggaaa caacatccgc tacttcatcc tgccggacag tcttcctctg gacacgctgc 300 tggtggacat cgagcccaaa gtcaagtcca agaagagaga agctgtggct ggccgtggtc 360 gaggaagagg acgtggtcgt ggccgaggaa gaggacgggg ccgcggtggg ccgaggagat 420 gaagatctca agccactgta cttttgtgcg cgttgtacag cattttttgt tcttcgtata 480 ggagatattt tgcgttttgt acattaaatt ctgttctttt gacaactgtt tcgagtcttt 540 tgttttgtgg tgatcatatg agggatgatt ctgcaagggt gtctggttat ttttctacat 600 tcttaccaat aaaaactgct tgattgaaaa 630 92 119 PRT Danio rerio 92 Met Lys Leu Val Arg Phe Leu Met Lys Leu Ser His Glu Thr Val Thr 1 5 10 15 Ile Glu Leu Lys Asn Gly Thr Gln Val His Gly Thr Ile Thr Gly Val 20 25 30 Asp Val Ser Met Asn Thr His Leu Lys Ala Val Lys Met Thr Leu Lys 35 40 45 Asn Arg Glu Pro Thr Gln Leu Glu Ser Leu Ser Ile Arg Gly Asn Asn 50 55 60 Ile Arg Tyr Phe Ile Leu Pro Asp Ser Leu Pro Leu Asp Thr Leu Leu 65 70 75 80 Val Asp Ile Glu Pro Lys Val Lys Ser Lys Lys Arg Glu Ala Val Ala 85 90 95 Gly Arg Gly Arg Gly Arg Gly Arg Gly Arg Gly Arg Gly Arg Gly Arg 100 105 110 Gly Arg Gly Gly Pro Arg Arg 115 93 1773 DNA Danio rerio 93 ggaagtgcta atttggcggg agcagttgaa gatggatgcg cgcagagtga agcagaaagt 60 gaacgcgggc tttaaaatgc gtggattaat cctgcgaccc gagtccagcc ggttcctgct 120 gcaggttctg gagtccgtca gtccggcaga gctggaggag gttctggagc gggttctaga 180 ctctgtggag aaacagccac tgtcctccag tatggtggat ctgtgtgtgc tggaagccgc 240 tgtgcaggac tgcagtcagt cgtgtgacga gaccattgat catgtcttca acatcattgg 300 agcgtttgat gttcctcgct tcatcttcag cacagagagg aagaagtttg tgccgatcag 360 catgaccaat cacccgctgc ctaaactctg cggtcaggcc cgagataaag ctgagctgtt 420 cagagagaga tacaccatcc tgcagcagcg aacgcatcgg cacgagctct tcactccgcc 480 tgtcatcggt tcagctcagg atgaaggaag aaacaagttc cagctgaaga cagtggaggc 540 tcttttgtgc agcagcgctc ggctcgggga ggtgattgtt ctgggcatga tcacgcagct 600 ccgagagggt aagttctacc tggaggaccc cagtggtact gtgcagctcg acatctcaaa 660 ggcacagttc cacagtgggc tgtacactga gagctgcttc gtgctagcgg agggctggta 720 tgaggactct gtgttccatg tcagtgcgtt cggcttcccg cctacagaac cctcgtcctt 780 caccagggcg tatttcggca atgtgaactt cttcggcggg ccgtccagca cagcggtgaa 840 agcgtccgct aaactgaagc agctggagga ggagaacgag gacgccatgt ttgtgctggt 900 gtcagacgtg tggctggaca gtgtggaggt gctggagaag atacacacca tgttctctgg 960 atactccgct ctgccgccca cctgcttcat cttctgtggg agtttctcct cggctccgta 1020 tggcaggact cagctgcgct cgctcagaga ctcctttaaa gcgctcgctg acctcatctg 1080 tgagtttccc agcattcaca gcagcagtcg gtttgtgttt gttcctggtc cagaagaccc 1140 tggacccggc gcggttttac ccaggcctcc actggcggag cacatcacag aggagtttca 1200 gcagcgtgtg ccgctctctg tcttcaccac caacccctgc aggatccagt actgcagtca 1260 ggagctggtg attattcggg aggatctggt caataagatg tgccggaact gtgagcgtct 1320 gccgagcagc agcctggaca tccctacaca ctttgtgaag accgtgctgt ctcagggtca 1380 cctgacgccg ctgccgctgt acgtgtgtcc ggtgttctgg gcgtacgatc acacactccg 1440 cctgtatcct gtaccagacg tcatcgtgtt tgcagacaaa tacgacccgt tcaacatctg 1500 caacaccgac tgcctgtgca tcaacccggg ctccttcccg agaagcggat tctccttcaa 1560 ggtgtattac ccgtccagca ggactgtgga ggacagtaag ctccagggcc tgtgacccag 1620 tggaacgccc ccccgcctca tttctgttgc cctctgtaaa tgtggatgtt ttaggttgtg 1680 tttcttcccg tgacgtttga ttttatgccc ttttaaagtc caataaaaaa ataaaaattt 1740 gagacgtaaa aaaaaaaaaa aaaaaaaaaa aaa 1773 94 527 PRT Danio rerio 94 Met Asp Ala Arg Arg Val Lys Gln Lys Val Asn Ala Gly Phe Lys Met 1 5 10 15 Arg Gly Leu Ile Leu Arg Pro Glu Ser Ser Arg Phe Leu Leu Gln Val 20 25 30 Leu Glu Ser Val Ser Pro Ala Glu Leu Glu Glu Val Leu Glu Arg Val 35 40 45 Leu Asp Ser Val Glu Lys Gln Pro Leu Ser Ser Ser Met Val Asp Leu 50 55 60 Cys Val Leu Glu Ala Ala Val Gln Asp Cys Ser Gln Ser Cys Asp Glu 65 70 75 80 Thr Ile Asp His Val Phe Asn Ile Ile Gly Ala Phe Asp Val Pro Arg 85 90 95 Phe Ile Phe Ser Thr Glu Arg Lys Lys Phe Val Pro Ile Ser Met Thr 100 105 110 Asn His Pro Leu Pro Lys Leu Cys Gly Gln Ala Arg Asp Lys Ala Glu 115 120 125 Leu Phe Arg Glu Arg Tyr Thr Ile Leu Gln Gln Arg Thr His Arg His 130 135 140 Glu Leu Phe Thr Pro Pro Val Ile Gly Ser Ala Gln Asp Glu Gly Arg 145 150 155 160 Asn Lys Phe Gln Leu Lys Thr Val Glu Ala Leu Leu Cys Ser Ser Ala 165 170 175 Arg Leu Gly Glu Val Ile Val Leu Gly Met Ile Thr Gln Leu Arg Glu 180 185 190 Gly Lys Phe Tyr Leu Glu Asp Pro Ser Gly Thr Val Gln Leu Asp Ile 195 200 205 Ser Lys Ala Gln Phe His Ser Gly Leu Tyr Thr Glu Ser Cys Phe Val 210 215 220 Leu Ala Glu Gly Trp Tyr Glu Asp Ser Val Phe His Val Ser Ala Phe 225 230 235 240 Gly Phe Pro Pro Thr Glu Pro Ser Ser Phe Thr Arg Ala Tyr Phe Gly 245 250 255 Asn Val Asn Phe Phe Gly Gly Pro Ser Ser Thr Ala Val Lys Ala Ser 260 265 270 Ala Lys Leu Lys Gln Leu Glu Glu Glu Asn Glu Asp Ala Met Phe Val 275 280 285 Leu Val Ser Asp Val Trp Leu Asp Ser Val Glu Val Leu Glu Lys Ile 290 295 300 His Thr Met Phe Ser Gly Tyr Ser Ala Leu Pro Pro Thr Cys Phe Ile 305 310 315 320 Phe Cys Gly Ser Phe Ser Ser Ala Pro Tyr Gly Arg Thr Gln Leu Arg 325 330 335 Ser Leu Arg Asp Ser Phe Lys Ala Leu Ala Asp Leu Ile Cys Glu Phe 340 345 350 Pro Ser Ile His Ser Ser Ser Arg Phe Val Phe Val Pro Gly Pro Glu 355 360 365 Asp Pro Gly Pro Gly Ala Val Leu Pro Arg Pro Pro Leu Ala Glu His 370 375 380 Ile Thr Glu Glu Phe Gln Gln Arg Val Pro Leu Ser Val Phe Thr Thr 385 390 395 400 Asn Pro Cys Arg Ile Gln Tyr Cys Ser Gln Glu Leu Val Ile Ile Arg 405 410 415 Glu Asp Leu Val Asn Lys Met Cys Arg Asn Cys Glu Arg Leu Pro Ser 420 425 430 Ser Ser Leu Asp Ile Pro Thr His Phe Val Lys Thr Val Leu Ser Gln 435 440 445 Gly His Leu Thr Pro Leu Pro Leu Tyr Val Cys Pro Val Phe Trp Ala 450 455 460 Tyr Asp His Thr Leu Arg Leu Tyr Pro Val Pro Asp Val Ile Val Phe 465 470 475 480 Ala Asp Lys Tyr Asp Pro Phe Asn Ile Cys Asn Thr Asp Cys Leu Cys 485 490 495 Ile Asn Pro Gly Ser Phe Pro Arg Ser Gly Phe Ser Phe Lys Val Tyr 500 505 510 Tyr Pro Ser Ser Arg Thr Val Glu Asp Ser Lys Leu Gln Gly Leu 515 520 525 95 2646 DNA Danio rerio 95 cgggggcaga cggacgcacg cggaacggta agatggcggc ggacggaggc cgagctgcgt 60 taccctgcgc cttcagccct aaatcacatc agtttttggc tctctgcgcg caagatggga 120 gactgcgaat ctggaatacc gagagcaaaa ccctacaaca ggaatacgtg ccttccgcgc 180 atctgagcgc agcctgtacg tgtgtgacat ggggtccgtg ccgagcagtt caggaggtgc 240 ctcagaggaa gaggaggaaa tgtgatgcgg cttcatcagc tgagcagtca gatctgttgg 300 cactgggcac tgctgctggc accatcctca tctacagcac actgaaggga gacctgcatt 360 gtacattgga tggaggacac agtgggcctg tgaacagcgt ccagtggcat ccagaagact 420 ctgtgttata cagcggttca gatgacacac acatagccga gtgggacttg aaaacgggga 480 aagtttgctg caaatggaag gcagacagat ccgctatcag cagcctgtgt ataagtccag 540 atggaaagat gctgctctct gcaggaatga ccatcaagat gtgggacctg gagactaagg 600 aagtgtacag gaaattcaca ggtcactcta ccatggttac gactctatgc tttgccacaa 660 cacgacctcc agacagcaat gggatgtatt tcctttcagg agccgtacat gacagactcc 720 tgagtgtgtg gcaggtacgg tccgatggca aagacaagaa ctctgtggtg tcttttactc 780 tgacagatga accccaacac atagaccttc agacatccaa cagcaaagat gaggctgtgc 840 ggttagcagt ggtttgtaag gatgggcagc ttcatctttt tgagcacttt ttgaatgggc 900 agtgtaagaa gccactgtct ccagcgtgtt cccttcaggt gtccactatg aagggagagt 960 ctccagtgcc tgtgcctcta ctggcagctg ctctgtgtgc ggacagacag aacctgatgc 1020 tggcatatgg acatcacctg cagcccgtca tggagaaaac accttttaac acatcaaaaa 1080 agcacacatg cttgaacccg agatgtacat gccacacttt ctctttctgt agacacgact 1140 gtctccaagg tgaagacacc agttgtaaat cagcaaagca aagttttaat cccaggagtt 1200 ccaggacata aggcaccaat taaattaact cccaatgaag gagagaaaag gaaaaaaggg 1260 gccagtacta cagaggtgtc tatcgaggaa cggttggagc agattgagtt gagtgctgga 1320 agaaaaggtg ctgataaagg ctcgtcctcc ctgcagacag acagtttcgc tgtgctgctt 1380 gtccagggct tagagagcaa agatgagaag atccttaata aaatatttca aaccaaaaaa 1440 gagacgttga tcaagaacac tgttgctcgg ttaccacctc ctgctgtcct gccattgctg 1500 gaagaactct ccagaagact gcagggacat ccatacgcgg ctttgcttat agttcattgg 1560 tttaaggctg tacttacgca acacacatca tacttgtctt cattgccaga tctagtgtct 1620 cagttgggct caatctacca aatgatcgag agcagggtga agttatatca tcagatcacc 1680 agactccatg gaaaactcta cctgctcatg acacaggtgg cgacaagcaa cagtgcaaca 1740 aaagttggtg aaattgacca cacagcaaag ctggtatatg aagaagagtc ttcagaggat 1800 gaagattcag gagatgaagg tcgtcctgag gatgactctg ataactggga ggaggatgag 1860 gaggctatgg aggtgactgg agacaaaaag aaccacaaag atgaggaaga ccaagacatg 1920 aatgaggaat caaaagcaaa tggagactca gatctggacc ctgaaaatga gagtgaggag 1980 gaatagtact caatcaggct gcaggattca gataaaagga cccttacatt tatatatatg 2040 attttgtttt acttttcatt ttattgtaac aaaacaaaaa ggcgaagttt tatagagttt 2100 cgataatcgt tgcggcctgg atctgtggtt gtcctttttt gaactgttat gaccagactg 2160 acagcccaaa caaacacatt cgaggtccat ttcagtcgct tttcccttga tttggaaacg 2220 gcctctgttt tgttgcccgg gttttatggg attctctggg gttcccaaaa tctccatctc 2280 cctgcaagat acattcttaa tgggcgtccc atgtccataa gatcgatccc attatatgac 2340 atgatcatga ttctgctcat catcagtgtc catctttaat ttatctgtgg tcattacact 2400 cgctgtcaca gtcatttctc atgtctgact tctgtgtcct gtgtgtaatt tgagaagaag 2460 attattacac cctctgcagt catgactgtt ctcagttttg aggacaaaac aatgtgaagc 2520 tttcccagaa aaaacgaata tcctaaaaac agtcacagag gtgtgtatga taactataca 2580 gtggttacag ccaaccagaa cttttcatgc aaacaatata tcaacaactg ttgtacagtt 2640 taaatg 2646 96 650 PRT Danio rerio 96 Met Ala Ala Asp Gly Gly Arg Ala Ala Leu Pro Cys Ala Phe Ser Pro 1 5 10 15 Lys Ser His Gln Phe Leu Ala Leu Cys Ala Gln Asp Gly Arg Leu Arg 20 25 30 Ile Trp Asn Thr Glu Ser Lys Thr Leu Gln Gln Glu Tyr Val Pro Ser 35 40 45 Ala His Leu Ser Ala Ala Cys Thr Cys Val Thr Trp Gly Pro Cys Arg 50 55 60 Ala Val Gln Glu Val Pro Gln Arg Lys Arg Arg Lys Cys Asp Ala Ala 65 70 75 80 Ser Ser Ala Glu Gln Ser Asp Leu Leu Ala Leu Gly Thr Ala Ala Gly 85 90 95 Thr Ile Leu Ile Tyr Ser Thr Leu Lys Gly Asp Leu His Cys Thr Leu 100 105 110 Asp Gly Gly His Ser Gly Pro Val Asn Ser Val Gln Trp His Pro Glu 115 120 125 Asp Ser Val Leu Tyr Ser Gly Ser Asp Asp Thr His Ile Ala Glu Trp 130 135 140 Asp Leu Lys Thr Gly Lys Val Cys Cys Lys Trp Lys Ala Asp Arg Ser 145 150 155 160 Ala Ile Ser Ser Leu Cys Ile Ser Pro Asp Gly Lys Met Leu Leu Ser 165 170 175 Ala Gly Met Thr Ile Lys Met Trp Asp Leu Glu Thr Lys Glu Val Tyr 180 185 190 Arg Lys Phe Thr Gly His Ser Thr Met Val Thr Thr Leu Cys Phe Ala 195 200 205 Thr Thr Arg Pro Pro Asp Ser Asn Gly Met Tyr Phe Leu Ser Gly Ala 210 215 220 Val His Asp Arg Leu Leu Ser Val Trp Gln Val Arg Ser Asp Gly Lys 225 230 235 240 Asp Lys Asn Ser Val Val Ser Phe Thr Leu Thr Asp Glu Pro Gln His 245 250 255 Ile Asp Leu Gln Thr Ser Asn Ser Lys Asp Glu Ala Val Arg Leu Ala 260 265 270 Val Val Cys Lys Asp Gly Gln Leu His Leu Phe Glu His Phe Leu Asn 275 280 285 Gly Gln Cys Lys Lys Pro Leu Ser Pro Ala Cys Ser Leu Gln Val Ser 290 295 300 Thr Met Lys Gly Glu Ser Pro Val Pro Val Pro Leu Leu Ala Ala Ala 305 310 315 320 Leu Cys Ala Asp Arg Gln Asn Leu Met Leu Ala Tyr Gly His His Leu 325 330 335 Gln Pro Val Met Glu Lys Thr Pro Phe Asn Thr Ser Lys Lys His Thr 340 345 350 Cys Leu Asn Pro Arg Cys Thr Cys His Thr Phe Ser Phe Cys Arg His 355 360 365 Asp Cys Leu Gln Gly Glu Asp Thr Ser Cys Lys Ser Ala Lys Gln Ser 370 375 380 Phe Asn Pro Arg Ser Ser Arg Thr Lys Ala Pro Ile Lys Leu Thr Pro 385 390 395 400 Asn Glu Gly Glu Lys Arg Lys Lys Gly Ala Ser Thr Thr Glu Val Ser 405 410 415 Ile Glu Glu Arg Leu Glu Gln Ile Glu Leu Ser Ala Gly Arg Lys Gly 420 425 430 Ala Asp Lys Gly Ser Ser Ser Leu Gln Thr Asp Ser Phe Ala Val Leu 435 440 445 Leu Val Gln Gly Leu Glu Ser Lys Asp Glu Lys Ile Leu Asn Lys Ile 450 455 460 Phe Gln Thr Lys Lys Glu Thr Leu Ile Lys Asn Thr Val Ala Arg Leu 465 470 475 480 Pro Pro Pro Ala Val Leu Pro Leu Leu Glu Glu Leu Ser Arg Arg Leu 485 490 495 Gln Gly His Pro Tyr Ala Ala Leu Leu Ile Val His Trp Phe Lys Ala 500 505 510 Val Leu Thr Gln His Thr Ser Tyr Leu Ser Ser Leu Pro Asp Leu Val 515 520 525 Ser Gln Leu Gly Ser Ile Tyr Gln Met Ile Glu Ser Arg Val Lys Leu 530 535 540 Tyr His Gln Ile Thr Arg Leu His Gly Lys Leu Tyr Leu Leu Met Thr 545 550 555 560 Gln Val Ala Thr Ser Asn Ser Ala Thr Lys Val Gly Glu Ile Asp His 565 570 575 Thr Ala Lys Leu Val Tyr Glu Glu Glu Ser Ser Glu Asp Glu Asp Ser 580 585 590 Gly Asp Glu Gly Arg Pro Glu Asp Asp Ser Asp Asn Trp Glu Glu Asp 595 600 605 Glu Glu Ala Met Glu Val Thr Gly Asp Lys Lys Asn His Lys Asp Glu 610 615 620 Glu Asp Gln Asp Met Asn Glu Glu Ser Lys Ala Asn Gly Asp Ser Asp 625 630 635 640 Leu Asp Pro Glu Asn Glu Ser Glu Glu Glu 645 650 97 1197 DNA Danio rerio 97 aaacacacac gcacgggggg aacattggat taaacagccg ccatgcagaa taaagaaaac 60 cgggaaccca gagttcaaca aacaccgtct gctggtgtgg gtccgctcag agttgagatg 120 aatccagaca cacacgcagt ctcaggtcct ggcagagttc ctgtcaagtc aaactccaaa 180 gtgctgtcca tcgatgactt tgacattggc cgtcctctgg gaaagggtaa atttgggaac 240 gtgtatctgg cgcgcgagcg gaagctgaag gtggtgatcg cgctgaaggt gctcttcaag 300 tctcagatgg taaaagaggg agtggagcat cagctgcgca gggagatcga gatacagtca 360 cacctcaggc atcccaacat ccttcgcttc tacaactact tccacgacga cacccgtgtg 420 tttctgattt tggagtatgc gccgcgtggt gagatgtaca aagagcttca gcgatacgga 480 cactttgacg atcagcgcac tgctacttac atggaggagg tgtcggacgc gctgcagtac 540 tgccatgaga agaaggtgat ccacagagac attaaaccag aaaacctgct gctgggatac 600 agaggagaac tgaagatcgc agacttcggc tggtctgtgc acgcgccctc actcagacgg 660 cggacgatgt gcggtacgct ggattacctg ccgccagaga tgatcgaggg ccacagtcac 720 gatgagaagg tggatctgtg gtccatcggt gtgctgtgct acgagtgttt agtgggaaat 780 cctccattcg agactcgcca gcacgccgag acatacaagc gcatcactaa ggtggatctg 840 cagttcccga agctggtgtc tgagggcgcg cgggacctga tctccaagct gctgcgccac 900 agtccctcca tgcgcctgcc gctgcgcagt gtgatggagc accgtggggt gaaggccaac 960 tcccgcaggg tgctcccgcc cgtctgcagc tccgacccac actgaccccc caggacctga 1020 ttgaggcctg gcatatacac ccacccgact caatctgtgt gtgtgtgtgt gtgtgcgtca 1080 gagcaaagca tctccactgt atcacacctg ttcttctgtt cacatccatg ttttaatctt 1140 gtctgtcttg tttaatcctg tcaaatatat aaagctgttt aaataaaaaa aaaaaaa 1197 98 320 PRT Danio rerio 98 Met Gln Asn Lys Glu Asn Arg Glu Pro Arg Val Gln Gln Thr Pro Ser 1 5 10 15 Ala Gly Val Gly Pro Leu Arg Val Glu Met Asn Pro Asp Thr His Ala 20 25 30 Val Ser Gly Pro Gly Arg Val Pro Val Lys Ser Asn Ser Lys Val Leu 35 40 45 Ser Ile Asp Asp Phe Asp Ile Gly Arg Pro Leu Gly Lys Gly Lys Phe 50 55 60 Gly Asn Val Tyr Leu Ala Arg Glu Arg Lys Leu Lys Val Val Ile Ala 65 70 75 80 Leu Lys Val Leu Phe Lys Ser Gln Met Val Lys Glu Gly Val Glu His 85 90 95 Gln Leu Arg Arg Glu Ile Glu Ile Gln Ser His Leu Arg His Pro Asn 100 105 110 Ile Leu Arg Phe Tyr Asn Tyr Phe His Asp Asp Thr Arg Val Phe Leu 115 120 125 Ile Leu Glu Tyr Ala Pro Arg Gly Glu Met Tyr Lys Glu Leu Gln Arg 130 135 140 Tyr Gly His Phe Asp Asp Gln Arg Thr Ala Thr Tyr Met Glu Glu Val 145 150 155 160 Ser Asp Ala Leu Gln Tyr Cys His Glu Lys Lys Val Ile His Arg Asp 165 170 175 Ile Lys Pro Glu Asn Leu Leu Leu Gly Tyr Arg Gly Glu Leu Lys Ile 180 185 190 Ala Asp Phe Gly Trp Ser Val His Ala Pro Ser Leu Arg Arg Arg Thr 195 200 205 Met Cys Gly Thr Leu Asp Tyr Leu Pro Pro Glu Met Ile Glu Gly His 210 215 220 Ser His Asp Glu Lys Val Asp Leu Trp Ser Ile Gly Val Leu Cys Tyr 225 230 235 240 Glu Cys Leu Val Gly Asn Pro Pro Phe Glu Thr Arg Gln His Ala Glu 245 250 255 Thr Tyr Lys Arg Ile Thr Lys Val Asp Leu Gln Phe Pro Lys Leu Val 260 265 270 Ser Glu Gly Ala Arg Asp Leu Ile Ser Lys Leu Leu Arg His Ser Pro 275 280 285 Ser Met Arg Leu Pro Leu Arg Ser Val Met Glu His Arg Gly Val Lys 290 295 300 Ala Asn Ser Arg Arg Val Leu Pro Pro Val Cys Ser Ser Asp Pro His 305 310 315 320 99 2007 DNA Danio rerio 99 tggctgccat ctgtaacttc catgtgctgt tttattatat cacaaaataa ctgctctcat 60 catttttctt atccgtaaag ttaatttcat tgtattggat taacactgac atttacgtac 120 agagagccgg ttgttcaaag aagccgtcag aatgcaacac gacgacgtta tatgggacct 180 tatcggcaat aagagttttt gttcctataa agttaaaacg aaaactcagc agttctggcg 240 aaatgaatac aacatcacag gcctttgcaa cagatcatca tgccctcttg ctaatagcca 300 gtatgcaact ataagagaag aaaaaggcca gtgttttctc tacatgaaag ttattgagag 360 agcagcattt ccttctcgca tgtgggagaa ggtgaaactt gacagaaact atgcaaaggc 420 tctggaacag atcgacgaga acctcatcta ctggccacgg tttatccgtc acaagtgtaa 480 acagcgattg accaagatca cccagtatct cattcgaatc cgcaagctca cgctaaaacg 540 acagagaaaa ctggttccac tcagtagaaa agtagagaga cgagaaaaga ggagagagga 600 aaaagctctt attgccgctc agctggaaaa tgccatcgaa aaagagctgc tagatcggct 660 aaaacaggga acatatggag acatttacaa cttccccatc aatgccttcg acaaagctat 720 ggaaaaacag gatgaagaaa gtgaatctga ggaggaggaa gaagaggaag acgaagagga 780 gtctggaacg agagaattta ttgctgagga tgagtttgaa gagagtgatc tcagtgattt 840 tgaggaattg aacaacctga aggacagcag tgacgaagaa gtgaacgatg aagaggaatc 900 cagtgaggag tcagaggaag agtctgagga ggaggaggtg aagagcaagg ccaaatctaa 960 aggaaaagct ccactcaaag gcccaagcag gaaaaacgag cctatgtgga gatcgagtat 1020 gaacaggaga cagaaccagc tcagaaaagc aaggccacat agaggctctg tggaacgttt 1080 ccaaatcatg gaatgtgatc atttaccaga cacttggatt cagttagaac gtacactggt 1140 gtgttcaaac actcccatta aaagctttct tgtgggaatc tttttttttt tttttttttt 1200 acattgaatt atgctgatgc caagcataag tgctgtttta agtatacatt tggattaact 1260 tgatccatgt aaaaaataat aaaattaagt tcatatgaca tgcagggttt accaagtagt 1320 ggctggtgtt gtaaaagggt gaacatgtta aatatgtttt gtatatgaat acatggattt 1380 gtattgacgt taacagttga taatcaagct gcaaggagta aaattatctt tagtaaaaat 1440 agttattgaa atgaatctat atacacacta gtatcaatgc agagataaat gggatagagg 1500 aacggtaaat gttttatttt tggtcggatg tctttttaat caggagccta ctgtcttaac 1560 catagcttac ctcatttaat aataatgtaa acaacaacaa aggtcatatt ttaggtttta 1620 tgcttattaa attgattgga agaatctcaa gcaatttaaa caaaaaaaaa caaatcaggt 1680 ttatcaactt tatttaaaaa acaacaacaa ccctgccacg tttagtacct gcattcatag 1740 aatggagtaa atagattccc gttctatcaa gccaacttca cttcattcga gaggaaaagt 1800 gacatatggt caaagcttac ggacaagatg ctgcaaggac attttggata cacttgaaga 1860 ctgggtcagg gacacaactg ctgtctacct ctcttagaag agcctggggg gggggaaact 1920 ggaaatgtta taatggcgtt aaacagtgtg cagaaacaag atcattttac agtcattgaa 1980 taaaacactg gcttgtaaat ctgaaaa 2007 100 351 PRT Danio rerio 100 Met Gln His Asp Asp Val Ile Trp Asp Leu Ile Gly Asn Lys Ser Phe 1 5 10 15 Cys Ser Tyr Lys Val Lys Thr Lys Thr Gln Gln Phe Trp Arg Asn Glu 20 25 30 Tyr Asn Ile Thr Gly Leu Cys Asn Arg Ser Ser Cys Pro Leu Ala Asn 35 40 45 Ser Gln Tyr Ala Thr Ile Arg Glu Glu Lys Gly Gln Cys Phe Leu Tyr 50 55 60 Met Lys Val Ile Glu Arg Ala Ala Phe Pro Ser Arg Met Trp Glu Lys 65 70 75 80 Val Lys Leu Asp Arg Asn Tyr Ala Lys Ala Leu Glu Gln Ile Asp Glu 85 90 95 Asn Leu Ile Tyr Trp Pro Arg Phe Ile Arg His Lys Cys Lys Gln Arg 100 105 110 Leu Thr Lys Ile Thr Gln Tyr Leu Ile Arg Ile Arg Lys Leu Thr Leu 115 120 125 Lys Arg Gln Arg Lys Leu Val Pro Leu Ser Arg Lys Val Glu Arg Arg 130 135 140 Glu Lys Arg Arg Glu Glu Lys Ala Leu Ile Ala Ala Gln Leu Glu Asn 145 150 155 160 Ala Ile Glu Lys Glu Leu Leu Asp Arg Leu Lys Gln Gly Thr Tyr Gly 165 170 175 Asp Ile Tyr Asn Phe Pro Ile Asn Ala Phe Asp Lys Ala Met Glu Lys 180 185 190 Gln Asp Glu Glu Ser Glu Ser Glu Glu Glu Glu Glu Glu Glu Asp Glu 195 200 205 Glu Glu Ser Gly Thr Arg Glu Phe Ile Ala Glu Asp Glu Phe Glu Glu 210 215 220 Ser Asp Leu Ser Asp Phe Glu Glu Leu Asn Asn Leu Lys Asp Ser Ser 225 230 235 240 Asp Glu Glu Val Asn Asp Glu Glu Glu Ser Ser Glu Glu Ser Glu Glu 245 250 255 Glu Ser Glu Glu Glu Glu Val Lys Ser Lys Ala Lys Ser Lys Gly Lys 260 265 270 Ala Pro Leu Lys Gly Pro Ser Arg Lys Asn Glu Pro Met Trp Arg Ser 275 280 285 Ser Met Asn Arg Arg Gln Asn Gln Leu Arg Lys Ala Arg Pro His Arg 290 295 300 Gly Ser Val Glu Arg Phe Gln Ile Met Glu Cys Asp His Leu Pro Asp 305 310 315 320 Thr Trp Ile Gln Leu Glu Arg Thr Leu Val Cys Ser Asn Thr Pro Ile 325 330 335 Lys Ser Phe Leu Val Gly Ile Phe Phe Phe Phe Phe Phe Leu His 340 345 350 101 1247 DNA Danio rerio 101 acacacgccg gggttcatca tggcggcggg accgatttca gaacgaaatc aagatgccgc 60 tgtatatgtg ggtggtctcg atgagaaagt ctctgagcct cttttgtggg agcttttcct 120 tcaagctggt cctgttgtaa acacacacat gcccaaagac agagtaacag gacaacacca 180 gggttatggc tttgtggagt ttctcagcga agaagatgcg gactacgcca taaaaataat 240 gaatatgatc aaactttacg gcaagcccat acgagtcaac aaagcttcag cgcacaacaa 300 gaacctcgac gttggtgcca acatcttcat tggcaacttg gaccctgaga tcgacgagaa 360 attactttat gacacattca gcgctttcgg tgtgatcctt cagaccccca agatcatgcg 420 agatcctgat acgggcaact cgaaaggtta cgccttcatc aactttgcca gttttgatgc 480 gtcagatgcg gctatcgagg ctatgaatgg ccagtatctg tgtaatcggc ccatcactgt 540 ttcatacgcg tttaagaaag actcaaaggg tgaaagacac gggtcagcag ccgagcgtct 600 ccttgccgca cagaatccac tgtcccaggc cgatcgccca catcagcttt tcgcagatgc 660 tccacctcct ccaagcatgc ccacgcctgt catgactgcc ctcggagcag caatgcctat 720 tccaggaatg cctcctcccg gtgctttccc acctgttcct cctcctggca caatgccacc 780 tgggatgcct cccggtatgc ccatgcctcc agcaccaggt actcctgcgc cacagggcgg 840 tggaggaccc ccaccgggac acccgccctt ccctcctgct ggaatgcatc cgccaggaat 900 gccgcatatg ccgatgcctc caacgggacc tccaggaatg gtgcctcccc cacccggccc 960 accaggatct aaccaaccaa gagcaccgcc tccaccacaa atgcctcctc cacccatggg 1020 tgtccctccc agaggaccct ttggaccccc aatgggtcca ccaatgcatc cgggtatgag 1080 gggtcctcct ccaccaatgc cgcctccagg ttatggtgca ggaccaccca aacccccacc 1140 ctttggtttt caaaaagggc ctcctatgcc gctcgtcctc ctgacccaaa ggtccgatga 1200 aagcaccggc ccttcctaaa ccccaatgtg aaaaccaaac tttgtct 1247 102 399 PRT Danio rerio 102 Met Ala Ala Gly Pro Ile Ser Glu Arg Asn Gln Asp Ala Ala Val Tyr 1 5 10 15 Val Gly Gly Leu Asp Glu Lys Val Ser Glu Pro Leu Leu Trp Glu Leu 20 25 30 Phe Leu Gln Ala Gly Pro Val Val Asn Thr His Met Pro Lys Asp Arg 35 40 45 Val Thr Gly Gln His Gln Gly Tyr Gly Phe Val Glu Phe Leu Ser Glu 50 55 60 Glu Asp Ala Asp Tyr Ala Ile Lys Ile Met Asn Met Ile Lys Leu Tyr 65 70 75 80 Gly Lys Pro Ile Arg Val Asn Lys Ala Ser Ala His Asn Lys Asn Leu 85 90 95 Asp Val Gly Ala Asn Ile Phe Ile Gly Asn Leu Asp Pro Glu Ile Asp 100 105 110 Glu Lys Leu Leu Tyr Asp Thr Phe Ser Ala Phe Gly Val Ile Leu Gln 115 120 125 Thr Pro Lys Ile Met Arg Asp Pro Asp Thr Gly Asn Ser Lys Gly Tyr 130 135 140 Ala Phe Ile Asn Phe Ala Ser Phe Asp Ala Ser Asp Ala Ala Ile Glu 145 150 155 160 Ala Met Asn Gly Gln Tyr Leu Cys Asn Arg Pro Ile Thr Val Ser Tyr 165 170 175 Ala Phe Lys Lys Asp Ser Lys Gly Glu Arg His Gly Ser Ala Ala Glu 180 185 190 Arg Leu Leu Ala Ala Gln Asn Pro Leu Ser Gln Ala Asp Arg Pro His 195 200 205 Gln Leu Phe Ala Asp Ala Pro Pro Pro Pro Ser Met Pro Thr Pro Val 210 215 220 Met Thr Ala Leu Gly Ala Ala Met Pro Ile Pro Gly Met Pro Pro Pro 225 230 235 240 Gly Ala Phe Pro Pro Val Pro Pro Pro Gly Thr Met Pro Pro Gly Met 245 250 255 Pro Pro Gly Met Pro Met Pro Pro Ala Pro Gly Thr Pro Ala Pro Gln 260 265 270 Gly Gly Gly Gly Pro Pro Pro Gly His Pro Pro Phe Pro Pro Ala Gly 275 280 285 Met His Pro Pro Gly Met Pro His Met Pro Met Pro Pro Thr Gly Pro 290 295 300 Pro Gly Met Val Pro Pro Pro Pro Gly Pro Pro Gly Ser Asn Gln Pro 305 310 315 320 Arg Ala Pro Pro Pro Pro Gln Met Pro Pro Pro Pro Met Gly Val Pro 325 330 335 Pro Arg Gly Pro Phe Gly Pro Pro Met Gly Pro Pro Met His Pro Gly 340 345 350 Met Arg Gly Pro Pro Pro Pro Met Pro Pro Pro Gly Tyr Gly Ala Gly 355 360 365 Pro Pro Lys Pro Pro Pro Phe Gly Phe Gln Lys Gly Pro Pro Met Pro 370 375 380 Leu Val Leu Leu Thr Gln Arg Ser Asp Glu Ser Thr Gly Pro Ser 385 390 395 103 838 DNA Danio rerio 103 cggcacgagc tcgagtgtct gtgcttcagt cttcgggcgg agaattccct ctgtaatcta 60 gagtcttcct gcgcttcatc tccgggataa agatggcccc gaaggattat acggcggaga 120 aagagaaatg taagcggttc ctgcaggagt tctacactga ggatgactcc gggaagaaga 180 tcttcaagta tggtgctcag ctggtatctc tggcacatcg ggagcaggtg gcgctgctgg 240 tggatctgga cgacgtggcg gaagaagacc cggatctggt ggagagtgtg tgtgagaacg 300 ccaaacgata cacagcgctg tttgcagacg ccattcatga gctgctgccg gagtacaggg 360 agcgggaggc ggtggtgaag gacgcgctgg acgtctacat cgagcaccgg ctgatgatgg 420 aggtcagagg tcgcgaccca gcagacacac gtgaccacag aaagcagtac ccgcccgagc 480 tgatgcgcag atttgaggtg tatttccgtc ctccggcgac gctgaagccg cgggttgtgc 540 gtgatgtgaa ggcggacagc atcgggcagc tggtgacggt ccgggtaatc gtgacgcgag 600 cgactgaagt caagcccatg atggccgtcg ctacgtacac ctgcgaccag tgcggcgccg 660 agacctacca gccgttttaa taataaagct cctcctcctg ctcctcctcc tgctctgctg 720 ttctgactct gctcctgtgt gagggtccgc tgttgcctgt atatttgcgc ttgtttgttg 780 ttgtttattt tctgcaataa agatgatggc cgtcaaaaaa aaaaaaaaaa aaaaaaaa 838 104 195 PRT Danio rerio 104 Met Ala Pro Lys Asp Tyr Thr Ala Glu Lys Glu Lys Cys Lys Arg Phe 1 5 10 15 Leu Gln Glu Phe Tyr Thr Glu Asp Asp Ser Gly Lys Lys Ile Phe Lys 20 25 30 Tyr Gly Ala Gln Leu Val Ser Leu Ala His Arg Glu Gln Val Ala Leu 35 40 45 Leu Val Asp Leu Asp Asp Val Ala Glu Glu Asp Pro Asp Leu Val Glu 50 55 60 Ser Val Cys Glu Asn Ala Lys Arg Tyr Thr Ala Leu Phe Ala Asp Ala 65 70 75 80 Ile His Glu Leu Leu Pro Glu Tyr Arg Glu Arg Glu Ala Val Val Lys 85 90 95 Asp Ala Leu Asp Val Tyr Ile Glu His Arg Leu Met Met Glu Val Arg 100 105 110 Gly Arg Asp Pro Ala Asp Thr Arg Asp His Arg Lys Gln Tyr Pro Pro 115 120 125 Glu Leu Met Arg Arg Phe Glu Val Tyr Phe Arg Pro Pro Ala Thr Leu 130 135 140 Lys Pro Arg Val Val Arg Asp Val Lys Ala Asp Ser Ile Gly Gln Leu 145 150 155 160 Val Thr Val Arg Val Ile Val Thr Arg Ala Thr Glu Val Lys Pro Met 165 170 175 Met Ala Val Ala Thr Tyr Thr Cys Asp Gln Cys Gly Ala Glu Thr Tyr 180 185 190 Gln Pro Phe 195 105 1915 DNA Danio rerio 105 gtatgaattt cttgctttca agtctgttta gtttcatttt attaaggatt taagtgctgc 60 ttgatttttt aatctactgt tacacagaat acagttttta aaacaggcgt tttactttaa 120 aaagaaggac atggcaacag attcgtgggc ccaggctgtc gatgaacaag aagctgcagc 180 ggaatcgata agcaccttgc aaataagtga aaaagaggag aagccgacag ctgaagcaaa 240 tggagcgaag acagaagcga agaccgaagc aaaaacagaa gcgaatgctg ataaaacgga 300 ggaggatgat aaagatgaca aggcagcaca gtcactgctg aataaactga tacgcagtaa 360 tctagtcaac accaccaatc aagttgaagt cctccagagg gatccgagct ctccgctata 420 ctctgtcaaa tcatttgaag agctgcgact gaaacctcag ctattgcaag gagtgtatgc 480 catgggtttt aatagaccct ccaaaatcca ggagactgca ttgcccatga tgctcgctga 540 acctccacag aatctgatcg ctcagtctca gtcaagaacg ggtaaaacgg ctgccttcgt 600 gctggccatg ttaagtcacg tggacactga gaacaaatgg cctgagtgtc tgtgtgtgtg 660 ccccacatat gagctggccc tacagactgg caaggtcatc gagcagatgg ggaaacatta 720 ccctgaagtc cagctagtgt acgccatcag aggaaacaaa ttggagcggg gtgcaaaact 780 gcaggagcag atcgtgattg gcactcctgg tactgttctg gattggtgcc aaaagctgaa 840 attcatagat cctaagaaga tcaaggtgtt tgtgttggac gaggctgacg tcatgatcgc 900 cacacagggt catcaagacc agagcgttcg catccagagg atgcttccta aaacttgcca 960 gatgctgttg ttctctgcca catttgagga aacggtgtgg aatttcgcaa agagaatcgt 1020 tcccgacccc aacattatca aactcaagcg tgaagaagag actctagaca ccatcaagca 1080 gtattatgtc atctgcaaca gcaaggagga gaaattccag gctttgtgca acatctacgg 1140 agccataacc atcgcacagg ctatgatctt ctgccatacc aggaaaactg ccgggtggtt 1200 ggcaggagag ctgtccagag aaggacacca ggtggcgctg ctaagtggag agatgcaggt 1260 ggagcagaga gcagcggtca ttgaacgctt ccgagacggc aaagagaaag tcctggttac 1320 cactaacgtc tgtgccagag gtatcgatgt ggagcaagtc tcagtggtga tcaacttcga 1380 tctgccagtg gacaaggacg gcaaccctga taatgagacg tatctgcacc gaattgggcg 1440 taccgggcga ttcggcaaaa gagggctggc gatcaacatg gcggacagca agtttagcat 1500 gaacaccctc aaccgcatcc aggaccactt caataagaaa attgagaaac tggacaccga 1560 tgatttggat gagatcgaga agatcgccaa ctgagcgtcc cgcctgtgac gtttacaata 1620 ggactgtgcc cacgctacat tttaattcaa aggctggaag cagagctgtg ccagagcgga 1680 tgtttacatg tggagcagtt ttatctcaga tggtgttttt ataggtcttt tattttgttt 1740 cactctttgt acttaataaa tgctgaatgt gtagtcttaa cgtcacgtct acgagaatcc 1800 tgcgggagcc gtctgggaca ccgacaagga gaatatgcct tttcaagatt aaccgttaca 1860 tcatgcacct ttttaaaatc gcctgccttc aaaaaaaaaa aaaaaaaaaa aaaaa 1915 106 487 PRT Danio rerio 106 Met Ala Thr Asp Ser Trp Ala Gln Ala Val Asp Glu Gln Glu Ala Ala 1 5 10 15 Ala Glu Ser Ile Ser Thr Leu Gln Ile Ser Glu Lys Glu Glu Lys Pro 20 25 30 Thr Ala Glu Ala Asn Gly Ala Lys Thr Glu Ala Lys Thr Glu Ala Lys 35 40 45 Thr Glu Ala Asn Ala Asp Lys Thr Glu Glu Asp Asp Lys Asp Asp Lys 50 55 60 Ala Ala Gln Ser Leu Leu Asn Lys Leu Ile Arg Ser Asn Leu Val Asn 65 70 75 80 Thr Thr Asn Gln Val Glu Val Leu Gln Arg Asp Pro Ser Ser Pro Leu 85 90 95 Tyr Ser Val Lys Ser Phe Glu Glu Leu Arg Leu Lys Pro Gln Leu Leu 100 105 110 Gln Gly Val Tyr Ala Met Gly Phe Asn Arg Pro Ser Lys Ile Gln Glu 115 120 125 Thr Ala Leu Pro Met Met Leu Ala Glu Pro Pro Gln Asn Leu Ile Ala 130 135 140 Gln Ser Gln Ser Arg Thr Gly Lys Thr Ala Ala Phe Val Leu Ala Met 145 150 155 160 Leu Ser His Val Asp Thr Glu Asn Lys Trp Pro Glu Cys Leu Cys Val 165 170 175 Cys Pro Thr Tyr Glu Leu Ala Leu Gln Thr Gly Lys Val Ile Glu Gln 180 185 190 Met Gly Lys His Tyr Pro Glu Val Gln Leu Val Tyr Ala Ile Arg Gly 195 200 205 Asn Lys Leu Glu Arg Gly Ala Lys Leu Gln Glu Gln Ile Val Ile Gly 210 215 220 Thr Pro Gly Thr Val Leu Asp Trp Cys Gln Lys Leu Lys Phe Ile Asp 225 230 235 240 Pro Lys Lys Ile Lys Val Phe Val Leu Asp Glu Ala Asp Val Met Ile 245 250 255 Ala Thr Gln Gly His Gln Asp Gln Ser Val Arg Ile Gln Arg Met Leu 260 265 270 Pro Lys Thr Cys Gln Met Leu Leu Phe Ser Ala Thr Phe Glu Glu Thr 275 280 285 Val Trp Asn Phe Ala Lys Arg Ile Val Pro Asp Pro Asn Ile Ile Lys 290 295 300 Leu Lys Arg Glu Glu Glu Thr Leu Asp Thr Ile Lys Gln Tyr Tyr Val 305 310 315 320 Ile Cys Asn Ser Lys Glu Glu Lys Phe Gln Ala Leu Cys Asn Ile Tyr 325 330 335 Gly Ala Ile Thr Ile Ala Gln Ala Met Ile Phe Cys His Thr Arg Lys 340 345 350 Thr Ala Gly Trp Leu Ala Gly Glu Leu Ser Arg Glu Gly His Gln Val 355 360 365 Ala Leu Leu Ser Gly Glu Met Gln Val Glu Gln Arg Ala Ala Val Ile 370 375 380 Glu Arg Phe Arg Asp Gly Lys Glu Lys Val Leu Val Thr Thr Asn Val 385 390 395 400 Cys Ala Arg Gly Ile Asp Val Glu Gln Val Ser Val Val Ile Asn Phe 405 410 415 Asp Leu Pro Val Asp Lys Asp Gly Asn Pro Asp Asn Glu Thr Tyr Leu 420 425 430 His Arg Ile Gly Arg Thr Gly Arg Phe Gly Lys Arg Gly Leu Ala Ile 435 440 445 Asn Met Ala Asp Ser Lys Phe Ser Met Asn Thr Leu Asn Arg Ile Gln 450 455 460 Asp His Phe Asn Lys Lys Ile Glu Lys Leu Asp Thr Asp Asp Leu Asp 465 470 475 480 Glu Ile Glu Lys Ile Ala Asn 485 107 1316 DNA Danio rerio 107 ccacgcgtcc gcctgaacga ggcacaacgt ggttttagga gaagccatac gacagtaggt 60 tcagaacgtt tagctttatt tatttgacca ctgttagatt tcaatatgac agaaggcaaa 120 tcatcagaca agccggcgaa aagactgctg gctctgaatc cgaaagagga cgcagaattt 180 cagaagaaag tgcagcaagt gaagaagcgc cctcaaacgg gtcaaacact tagtccagga 240 gtgctgtatg ttggtcatct tcctcgagga ctgtttgagc ctcagttgaa atcttacttt 300 gagcagtttg gcaaagtctt acgattgaga gtttccagga gtaaaaagac tggtgggagc 360 aaaggctatg gatttgtgga gtttgagtgt gatgaagtgg ctaagattgt ggctgaaacc 420 atgaacaact accttatggg agaacgcatt ataaaatgtc acgttattcc acctgagaag 480 gttcatgaaa agctctttgt cggctctatt gctggcttca agaagccaaa gtatcctgca 540 gtgactcggt ataacaaaac acacacagaa gatgatgtga agaaagttgg cacaaagctt 600 ttaagcaaag agtctaagct gcgcaagaga ctggcagcaa aaggcattga ctatgatttc 660 ccaggatttg cagcacagat tccagctaaa aaggctccgt cagaagcaaa tgtttcagtg 720 tgcagtgagg atgtaactcc agtgtgcaca ccttcacttt tggagaggag aaagtctctc 780 agagttgagg atgatgatgt tgatgatgaa atagtcatca aagtcaaacc agtcccagaa 840 aacagtgatg atgtggaaga aagtgaggaa gagtctgctg aagaggatga aggtgaagag 900 gaggaagcgg cttaaaagga gaatttagtg actaataaag actttattcg ctgattcaga 960 aggggacgct ggactcgctt ttcagagcca tttgtatgcg attagttctc agttctcccc 1020 taaggctgga gaagaggacc acacatatta ctcgcacaca ttatcctgat tgtcaggaat 1080 ctgttgattt gaggcgtgtt atgcagagtt tgtgcccgtg accttttaag ggtacaatgt 1140 ggttgaccat agagttgctg aaatgactgt attaatcaaa gcagatgtat agtgttaata 1200 tgaaaagaat tattttcatt gtatttgaca cattttctgt ccacatatgt tgcattgcaa 1260 tgtttgtaaa acatgtttaa accgtaaaat aaactgcttt aataaatgta aaaaaa 1316 108 269 PRT Danio rerio 108 Met Thr Glu Gly Lys Ser Ser Asp Lys Pro Ala Lys Arg Leu Leu Ala 1 5 10 15 Leu Asn Pro Lys Glu Asp Ala Glu Phe Gln Lys Lys Val Gln Gln Val 20 25 30 Lys Lys Arg Pro Gln Thr Gly Gln Thr Leu Ser Pro Gly Val Leu Tyr 35 40 45 Val Gly His Leu Pro Arg Gly Leu Phe Glu Pro Gln Leu Lys Ser Tyr 50 55 60 Phe Glu Gln Phe Gly Lys Val Leu Arg Leu Arg Val Ser Arg Ser Lys 65 70 75 80 Lys Thr Gly Gly Ser Lys Gly Tyr Gly Phe Val Glu Phe Glu Cys Asp 85 90 95 Glu Val Ala Lys Ile Val Ala Glu Thr Met Asn Asn Tyr Leu Met Gly 100 105 110 Glu Arg Ile Ile Lys Cys His Val Ile Pro Pro Glu Lys Val His Glu 115 120 125 Lys Leu Phe Val Gly Ser Ile Ala Gly Phe Lys Lys Pro Lys Tyr Pro 130 135 140 Ala Val Thr Arg Tyr Asn Lys Thr His Thr Glu Asp Asp Val Lys Lys 145 150 155 160 Val Gly Thr Lys Leu Leu Ser Lys Glu Ser Lys Leu Arg Lys Arg Leu 165 170 175 Ala Ala Lys Gly Ile Asp Tyr Asp Phe Pro Gly Phe Ala Ala Gln Ile 180 185 190 Pro Ala Lys Lys Ala Pro Ser Glu Ala Asn Val Ser Val Cys Ser Glu 195 200 205 Asp Val Thr Pro Val Cys Thr Pro Ser Leu Leu Glu Arg Arg Lys Ser 210 215 220 Leu Arg Val Glu Asp Asp Asp Val Asp Asp Glu Ile Val Ile Lys Val 225 230 235 240 Lys Pro Val Pro Glu Asn Ser Asp Asp Val Glu Glu Ser Glu Glu Glu 245 250 255 Ser Ala Glu Glu Asp Glu Gly Glu Glu Glu Glu Ala Ala 260 265 109 1940 DNA Danio rerio 109 aattcggcac gaggtaaaaa gcaacaaggc gccgtttgac tccgcagaat cggcttttgt 60 acgcggtccg agacctgttc acgggttctg cacacaggca tggcgcgttt tctgtagcgt 120 gatttatttt tgaatgtcga ttttgtaaaa cgccaccggc acctgtttat attgtgtttt 180 tgtccggatt tcaacagctc acccctgtgg ctttctctgg gcgttttgag ctgttttgct 240 gctggaagga tgtcaacaca aggaagcgcg gttcatgact ccgatattca caatcaggaa 300 aacatgcttt cgaggctacg cggtgctgct gcaaagaaca gggttgaaaa tcgtgaaaat 360 gtcaatccta accctaaagc agctaacaac aggacagtcc tgggcgagct ggagaacagc 420 cagaggagac agctcggtct gcgaggcgct aaacaggggt ctgggccaca gataattgcg 480 tgtaaacctg aagaaaatgc caggagtttt ggagagaggc ctagcaacag acagcctgct 540 gcttttcaga ttcacgtgga tgagcctgat ggcgcgtgct ccaagaaagc acctttacag 600 agatccacca tggactgttc cccgctgaca ctaaacccca ctgtcacccg cctcaggcag 660 cccctcgcca ccattgacct tccactggag gccagttttg attctccaat ggacatgtca 720 gtaatcgatt gtgaggaacg accaacaaat gtcaatgaag tctcagatta tgcagcagaa 780 atccacacgc atttgcggga aatggaggtc aagtctaaac cgaaagcagg ttacatgaga 840 aaacagccag acatcacaaa cagcatgcgt gctattctag tggactggtt ggtggaagtg 900 ggagaagaat acaagctaca gaacgagact ctttacctgg ctgtaaacta cattgatcgc 960 tttctgtcct ccatgtctgt gctgaggggg aaactgcagc tggtgggcac ggctgctatg 1020 cttttggctt cgaagtttga ggagatttac cctccagagg tggcggagtt tgtgtacatc 1080 actgacgaca cgtacacaaa gaaacaagtg ttgcggatgg agcatctggt gctgacagtt 1140 ctctcgtttg atctggccgc tcccaccatc aatcagttcc tcacccagta tttcttacac 1200 cagcctgtga gcagcaaagt cgagagctta tcaatgtttc ttggagagct cagcttgata 1260 gattgcgatc ccttcctcaa gtatctcccg tctcaaatgg ctgctgctgc tttcatcttg 1320 gccaaccaca cactggccag tggatcatgg tcgaagtctc ttgttgattt gacgggttac 1380 tctttggagg atcttctgcc gtgtgttcag gatctccatc agacatatct cgctgcttct 1440 caacacgctc aacaggctgt cagagagaaa tacaagggct caaagtacca tgaggtgtcc 1500 ctgatcgagc ctccagagaa gctgatgctg aactaaccct ccatcaacat tgtgttcatg 1560 tcttccagaa ctctgttagg aatcgtgcgc acactttgat gctttattac cataactcca 1620 ataactgaag ccatagcctc aatttgggga aatggtggga ttttagtgat ctggacattt 1680 aacttttttt ttatttttat ttttttattt gcagtcctta aggattgtgc cctttttatg 1740 cttgattcag ccagatattt gtacattttt aggtaatcaa gctttgtttg gatatttcag 1800 tgtttgtact ctatttttat gaaagactcc agcagtggac tttctatttt ccactgttaa 1860 agtgaccttc cctttacatc caaataaaag tgctgctggc ctttaaagca caaaaaaaaa 1920 aaaaaaaaaa aaaaaaaaaa 1940 110 428 PRT Danio rerio 110 Met Ser Thr Gln Gly Ser Ala Val His Asp Ser Asp Ile His Asn Gln 1 5 10 15 Glu Asn Met Leu Ser Arg Leu Arg Gly Ala Ala Ala Lys Asn Arg Val 20 25 30 Glu Asn Arg Glu Asn Val Asn Pro Asn Pro Lys Ala Ala Asn Asn Arg 35 40 45 Thr Val Leu Gly Glu Leu Glu Asn Ser Gln Arg Arg Gln Leu Gly Leu 50 55 60 Arg Gly Ala Lys Gln Gly Ser Gly Pro Gln Ile Ile Ala Cys Lys Pro 65 70 75 80 Glu Glu Asn Ala Arg Ser Phe Gly Glu Arg Pro Ser Asn Arg Gln Pro 85 90 95 Ala Ala Phe Gln Ile His Val Asp Glu Pro Asp Gly Ala Cys Ser Lys 100 105 110 Lys Ala Pro Leu Gln Arg Ser Thr Met Asp Cys Ser Pro Leu Thr Leu 115 120 125 Asn Pro Thr Val Thr Arg Leu Arg Gln Pro Leu Ala Thr Ile Asp Leu 130 135 140 Pro Leu Glu Ala Ser Phe Asp Ser Pro Met Asp Met Ser Val Ile Asp 145 150 155 160 Cys Glu Glu Arg Pro Thr Asn Val Asn Glu Val Ser Asp Tyr Ala Ala 165 170 175 Glu Ile His Thr His Leu Arg Glu Met Glu Val Lys Ser Lys Pro Lys 180 185 190 Ala Gly Tyr Met Arg Lys Gln Pro Asp Ile Thr Asn Ser Met Arg Ala 195 200 205 Ile Leu Val Asp Trp Leu Val Glu Val Gly Glu Glu Tyr Lys Leu Gln 210 215 220 Asn Glu Thr Leu Tyr Leu Ala Val Asn Tyr Ile Asp Arg Phe Leu Ser 225 230 235 240 Ser Met Ser Val Leu Arg Gly Lys Leu Gln Leu Val Gly Thr Ala Ala 245 250 255 Met Leu Leu Ala Ser Lys Phe Glu Glu Ile Tyr Pro Pro Glu Val Ala 260 265 270 Glu Phe Val Tyr Ile Thr Asp Asp Thr Tyr Thr Lys Lys Gln Val Leu 275 280 285 Arg Met Glu His Leu Val Leu Thr Val Leu Ser Phe Asp Leu Ala Ala 290 295 300 Pro Thr Ile Asn Gln Phe Leu Thr Gln Tyr Phe Leu His Gln Pro Val 305 310 315 320 Ser Ser Lys Val Glu Ser Leu Ser Met Phe Leu Gly Glu Leu Ser Leu 325 330 335 Ile Asp Cys Asp Pro Phe Leu Lys Tyr Leu Pro Ser Gln Met Ala Ala 340 345 350 Ala Ala Phe Ile Leu Ala Asn His Thr Leu Ala Ser Gly Ser Trp Ser 355 360 365 Lys Ser Leu Val Asp Leu Thr Gly Tyr Ser Leu Glu Asp Leu Leu Pro 370 375 380 Cys Val Gln Asp Leu His Gln Thr Tyr Leu Ala Ala Ser Gln His Ala 385 390 395 400 Gln Gln Ala Val Arg Glu Lys Tyr Lys Gly Ser Lys Tyr His Glu Val 405 410 415 Ser Leu Ile Glu Pro Pro Glu Lys Leu Met Leu Asn 420 425 111 1125 DNA Danio rerio 111 tgcgcttgct caatttgggc aatattttga aactttccaa tcaaaccaat atactttttc 60 ggcttatctg acatattttt aacggaaaaa ctcggatcca tcgcgaaata acggaattat 120 ccagcctatc atgtcacaaa cagaaagcag catggagcag cagcgggaag aagagccgag 180 ccaactcgac ccggccggtc cggcagagga tgatacttca gaaaatggcc atgatgaagc 240 atcagataca gctatggata gtgaagcaca gtcgtcatca cctccctcag caaagggaaa 300 ggacctgctg ggtggatatg aggaaaaagt gcagacagac cggaccaaca gatttgagta 360 cctcttgaag cagacagaag tgtttgctca tttcattcaa ccagcagccc agaagacccc 420 aacttcacct ttgaagatga agcctggccg tcctcgtatt aaaaaagatg agaagcagaa 480 cctgctgtct gctggagata accgccatcg tcggacggag caggaggagg atgaggagct 540 gctaagtgag aacagtaaag ccaccagtgt ctgcacacgt tcaagtctcg cacagccatg 600 gagcttcaga ggagatgcaa cacactgatc acactcatcg aacgtgaaaa catggagttg 660 gaagagcgag agaaagccga gaagaagaag agaggaccgc gcacatcttc ggctcagaaa 720 cgcaagcagg acgggacccc tgatggacgt ggccgcaaga aaaagctgaa attgtgagca 780 ggcaggcatt tcacacacct cactcggcga ggagcttcag tacagcaata ctgcattgat 840 tgttacgggt cccactcatg tactgtatgg atttgcagca ctgatcctcg cctctcaagt 900 agcttggcct tcttaacaag gtgtagagtt gtaaattagg tctcttttag ttatataatg 960 taactacggc tgtgctgtcg gatgtgtttt gtatttatgg ctacttcaaa tttttttttg 1020 taccacattc catttgcttg tatcagttta atttgcagtc tttaccccct cattattagt 1080 gtcttcagta ttgtattgtc tctgtatccg ccattggaaa gtgac 1125 112 213 PRT Danio rerio 112 Met Ser Gln Thr Glu Ser Ser Met Glu Gln Gln Arg Glu Glu Glu Pro 1 5 10 15 Ser Gln Leu Asp Pro Ala Gly Pro Ala Glu Asp Asp Thr Ser Glu Asn 20 25 30 Gly His Asp Glu Ala Ser Asp Thr Ala Met Asp Ser Glu Ala Gln Ser 35 40 45 Ser Ser Pro Pro Ser Ala Lys Gly Lys Asp Leu Leu Gly Gly Tyr Glu 50 55 60 Glu Lys Val Gln Thr Asp Arg Thr Asn Arg Phe Glu Tyr Leu Leu Lys 65 70 75 80 Gln Thr Glu Val Phe Ala His Phe Ile Gln Pro Ala Ala Gln Lys Thr 85 90 95 Pro Thr Ser Pro Leu Lys Met Lys Pro Gly Arg Pro Arg Ile Lys Lys 100 105 110 Asp Glu Lys Gln Asn Leu Leu Ser Ala Gly Asp Asn Arg His Arg Arg 115 120 125 Thr Glu Gln Glu Glu Asp Glu Glu Leu Leu Ser Glu Asn Ser Lys Ala 130 135 140 Thr Ser Val Cys Thr Arg Ser Ser Leu Ala Gln Pro Trp Ser Phe Arg 145 150 155 160 Gly Asp Ala Thr His Thr Leu Ile Glu Arg Glu Asn Met Glu Leu Glu 165 170 175 Glu Arg Glu Lys Ala Glu Lys Lys Lys Arg Gly Pro Arg Thr Ser Ser 180 185 190 Ala Gln Lys Arg Lys Gln Asp Gly Thr Pro Asp Gly Arg Gly Arg Lys 195 200 205 Lys Lys Leu Lys Leu 210 113 4221 DNA Danio rerio 113 gggttcgtgc tctgagttta cgcgccattt atgagtggga tttcagtgca gggaagtaaa 60 cgagcgcaga tctaccctcg ttttacacca caaacactca atatacgcac ttttattcaa 120 aaggagagca aaaagcggag taaaatcgac gagaaagagg gaaatcagcg gaatcgaagg 180 tgccgtgaat aatgcctgta aaaacaagca agacggccgg tcggagagcg gcggaggact 240 cggatgagga ggtgagcaaa ccgcgcaaag caccgagcac tgccgtccag agcagcccgc 300 cgcccgccct taacgatggg gctgcgggag agtctgctga ggtggttgat agccgcagtc 360 tggaggagat tctgggtagc atccctcctg caccgccccc agccatgagc tcagagcccg 420 gtgccccgcg actcatgatc acgcacatag tcaacagaaa cttcaagtcc tacgctggag 480 agcagattct ggggcccttc cataagcgct tttcctgcat cattggtcca aacggcagcg 540 gcaagtccaa cgtgatcgac tccatgctgt ttgtgtttgg atatagagct cagaaaatcc 600 gctcgaagaa actgtcggtg ctgatccaca gttctgatgg tcatccagac atccagagct 660 gcactgttga ggtccacttt cagaagatca ttgataagga aggagacgac tatgatgtca 720 tccccgacag ctccttctac gttgccagaa cggctggaaa agataattcc tctgcctact 780 acattaatac caagaaggca accttcaaag atgtgggcac attactgcgc agccacggca 840 tcgatctgga ccacaatcgt ttcctcatcc tgcagggtga ggtggagcag atcgccatga 900 tgaagcccaa aggtcagacg gagcatgatg agggcatgct ggagtacctg gaggacatca 960 tcggctcctg tcgtctcaaa gagcccatca acatcctctg tcggcgggtg gaggcgctga 1020 acgagcagcg aggagagaag ctgaatcgag tgaagatggt ggagaaagag aagagcgctc 1080 tggagggaga gaagaataaa gctgtggatt ttctcaactt agaaaatgac atctttaaga 1140 agaggaacct cctctgtcac ttctatgttc atgatctgca gagccgtgtg tccgtgaaag 1200 aagctgagaa acagcagatt caggaggaca caaaagattt gagtgacaaa agcagccagc 1260 tgactgaaga aatgaagagc aaaaacgagg acctgaaggc tgtggagaaa aaattgacca 1320 agctgaccaa gtacatcgag agtcaaaagg agaagttcac ccagcttgac ctgcaggatg 1380 tggaggtgcg cgagaagctc aaacacacca agagcaaaac caagaaactg cagaaacagc 1440 ttcagaaaga ccaggagaag ttggaggagg tgcggggtgt tcctgcgagc agcgagaaga 1500 tcatcactga ggcatccgct cagaaggagg agctggagaa aaagaagcta ctagaggagc 1560 agaagttagc agaggtgatg gagagcctga aggaggagac caaaggccta caggacgata 1620 aagagaaaaa agagaaggag ctgctggagt taagtaagtc ggtgaacgag actcgctcac 1680 ggatggatgt ggcgcaatca gagctggaca tctacctgag tcagcacaac accgctataa 1740 accagctcaa ccaggccaaa agtgctctgc aggagaccgt agacacgctg agagagagga 1800 gagccgccat taaagacctc caggtcaaaa taccagctca agaggagcag ctcaagaagg 1860 acgagcggga gctggagcag atctccgagc aggacaaaca gacgcgagcg caggtcgggg 1920 acatgagaca gaaggtggct gaagccaaga gctctctgtc ctccaaccgc agccgcagca 1980 aagtgctgga caccctcatg cagcagaaac gctcagggaa gatacctggc atcttgggac 2040 gactgggtga tctgggagcc attgatgaga agtatgacgt cgccatctcc tccagctgcg 2100 gctcgctgga caatattcta gtggacacca ttgacaccgc gcagaagtgc gtcacatttc 2160 tcaaggccca gaacatcggt gtggccacat tcatcggctt agacaagatg aaagtgtggc 2220 agcagagcat gggctccatc tctaccccgg agaacatccc tcgtctgttt gatatggtgc 2280 gggtgaagga tgagagcgtt cgtcctgctt tttacttcgc cctgagggac actctggtgg 2340 ctaaagacct ggagcaggcc acgcgggtcg ccttccagaa ggacaagcgc tggagggttg 2400 tcactctgca ggggcagatc atcgaacagg ccggaacgat gactggagga ggcgggcggg 2460 tgatgaaggg catgatgggc tcctcattct gtgctgatgt cacacaggag cagctggata 2520 aaatggagag tgctttgaac aaggaggtga cgcagctgca ggactgccag aagaggaaaa 2580 accagctgga ggagaaggtg cacaaagcca gatgtgagct gagagacatg aagaacacgc 2640 tggagaagta cacagccacc attcagagcc tcacagagca ggagcttcac ctcaagcctc 2700 agatcaagga cctggaggca aacgtgatcg ccgctgctcc tgataaagcc aagcagaagc 2760 agatggagaa gagtctggag gcctacagga aagactttga ggcagcgtcc aataaagcag 2820 gaaaggtgga ggccgaggtg aagcggctgc acacgctcat cgtggacatc aacagtcaca 2880 agctgaaggc gcagcaggac aaactggacc agatcaacac acagctggac cagtgctcct 2940 cagccgtcac caaagcccag gtggccatca agactgcggg acgtaatctg aagaaatcag 3000 aggacggtgt gagtcggctg gagcaggaga tcagtgagaa tgagaagctg atggaggagc 3060 tgacggagca gctgaagaaa ctggaggagc aggctggaga aatcatgcag tccaccagca 3120 gcggagaagg ctttgcccga ggtgcaggat cagcatcgtg cagtggtgca ggagatcaag 3180 gctctacagg agcaggagca cgcgttgcag aaagagtttc tgagcgtccg gttgaaggtg 3240 gagcacatcg acacggccat cacggagtgc cacaacaaga tcaaacactg gcagaaagag 3300 gccagcaagt tgtgtcttca ccacattgac ggagtgcctg cagaagagct gcccgccctg 3360 aacccagatc agctgcaaga catcggcaat ccagacgtca tcaagaatga gatcgctctg 3420 ctggaggacc gctgcgccaa catgaagccc aacctggggg ctatcgcaga gttcaagaag 3480 aaggaggagt tgtatctgca gagagtggca gagcttgatg acatcactac acagagagac 3540 agcttcaaga ggggctgcga ggacctgcgc aaacagagac tgcacgagtt catggcggga 3600 ttcaacatca tcaccaataa actgaaggag aactatcaga tgctcacgct agggggcgac 3660 gcagagctgg agctggtgga cagtctggat cctttctctg agggcatcat gttcagtgta 3720 cgccccccaa agaagagctg gaagaagatc tataatctgt ctggaggaga gaagacgctc 3780 agctctctgg cgctggtgtt tgctctgcat cactttaagc ccacaccgct ctacttcatg 3840 gacgagatcg acgctgctct cgacttcaag aacgtctcca tcgtggcctg ctacatatat 3900 gaacaaacca agaatgccca gttcatcatc atctctctga ggaacaacat gtttgagatg 3960 gccgaccggc tcattggcat ctacaaaacg cacaacccca ctaagaacgt ggccatcaac 4020 cccaaaacca tcatcctgcg agagatccag acggcatgag cgcagactca gtaatgcacc 4080 atcacctgtt ctgctttcac atcgctgtgc tcctgtgagc cttttacatg tctcccgtta 4140 ctctgctttt taattttaat ttttaataaa atgattttta ttcgccagaa aaaaaaaaaa 4200 aaaataaaaa aaaaaaaaaa a 4221 114 1287 PRT Danio rerio 114 Met Pro Val Lys Thr Ser Lys Thr Ala Gly Arg Arg Ala Ala Glu Asp 1 5 10 15 Ser Asp Glu Glu Val Ser Lys Pro Arg Lys Ala Pro Ser Thr Ala Val 20 25 30 Gln Ser Ser Pro Pro Pro Ala Leu Asn Asp Gly Ala Ala Gly Glu Ser 35 40 45 Ala Glu Val Val Asp Ser Arg Ser Leu Glu Glu Ile Leu Gly Ser Ile 50 55 60 Pro Pro Ala Pro Pro Pro Ala Met Ser Ser Glu Pro Gly Ala Pro Arg 65 70 75 80 Leu Met Ile Thr His Ile Val Asn Arg Asn Phe Lys Ser Tyr Ala Gly 85 90 95 Glu Gln Ile Leu Gly Pro Phe His Lys Arg Phe Ser Cys Ile Ile Gly 100 105 110 Pro Asn Gly Ser Gly Lys Ser Asn Val Ile Asp Ser Met Leu Phe Val 115 120 125 Phe Gly Tyr Arg Ala Gln Lys Ile Arg Ser Lys Lys Leu Ser Val Leu 130 135 140 Ile His Ser Ser Asp Gly His Pro Asp Ile Gln Ser Cys Thr Val Glu 145 150 155 160 Val His Phe Gln Lys Ile Ile Asp Lys Glu Gly Asp Asp Tyr Asp Val 165 170 175 Ile Pro Asp Ser Ser Phe Tyr Val Ala Arg Thr Ala Gly Lys Asp Asn 180 185 190 Ser Ser Ala Tyr Tyr Ile Asn Thr Lys Lys Ala Thr Phe Lys Asp Val 195 200 205 Gly Thr Leu Leu Arg Ser His Gly Ile Asp Leu Asp His Asn Arg Phe 210 215 220 Leu Ile Leu Gln Gly Glu Val Glu Gln Ile Ala Met Met Lys Pro Lys 225 230 235 240 Gly Gln Thr Glu His Asp Glu Gly Met Leu Glu Tyr Leu Glu Asp Ile 245 250 255 Ile Gly Ser Cys Arg Leu Lys Glu Pro Ile Asn Ile Leu Cys Arg Arg 260 265 270 Val Glu Ala Leu Asn Glu Gln Arg Gly Glu Lys Leu Asn Arg Val Lys 275 280 285 Met Val Glu Lys Glu Lys Ser Ala Leu Glu Gly Glu Lys Asn Lys Ala 290 295 300 Val Asp Phe Leu Asn Leu Glu Asn Asp Ile Phe Lys Lys Arg Asn Leu 305 310 315 320 Leu Cys His Phe Tyr Val His Asp Leu Gln Ser Arg Val Ser Val Lys 325 330 335 Glu Ala Glu Lys Gln Gln Ile Gln Glu Asp Thr Lys Asp Leu Ser Asp 340 345 350 Lys Ser Ser Gln Leu Thr Glu Glu Met Lys Ser Lys Asn Glu Asp Leu 355 360 365 Lys Ala Val Glu Lys Lys Leu Thr Lys Leu Thr Lys Tyr Ile Glu Ser 370 375 380 Gln Lys Glu Lys Phe Thr Gln Leu Asp Leu Gln Asp Val Glu Val Arg 385 390 395 400 Glu Lys Leu Lys His Thr Lys Ser Lys Thr Lys Lys Leu Gln Lys Gln 405 410 415 Leu Gln Lys Asp Gln Glu Lys Leu Glu Glu Val Arg Gly Val Pro Ala 420 425 430 Ser Ser Glu Lys Ile Ile Thr Glu Ala Ser Ala Gln Lys Glu Glu Leu 435 440 445 Glu Lys Lys Lys Leu Leu Glu Glu Gln Lys Leu Ala Glu Val Met Glu 450 455 460 Ser Leu Lys Glu Glu Thr Lys Gly Leu Gln Asp Asp Lys Glu Lys Lys 465 470 475 480 Glu Lys Glu Leu Leu Glu Leu Ser Lys Ser Val Asn Glu Thr Arg Ser 485 490 495 Arg Met Asp Val Ala Gln Ser Glu Leu Asp Ile Tyr Leu Ser Gln His 500 505 510 Asn Thr Ala Ile Asn Gln Leu Asn Gln Ala Lys Ser Ala Leu Gln Glu 515 520 525 Thr Val Asp Thr Leu Arg Glu Arg Arg Ala Ala Ile Lys Asp Leu Gln 530 535 540 Val Lys Ile Pro Ala Gln Glu Glu Gln Leu Lys Lys Asp Glu Arg Glu 545 550 555 560 Leu Glu Gln Ile Ser Glu Gln Asp Lys Gln Thr Arg Ala Gln Val Gly 565 570 575 Asp Met Arg Gln Lys Val Ala Glu Ala Lys Ser Ser Leu Ser Ser Asn 580 585 590 Arg Ser Arg Ser Lys Val Leu Asp Thr Leu Met Gln Gln Lys Arg Ser 595 600 605 Gly Lys Ile Pro Gly Ile Leu Gly Arg Leu Gly Asp Leu Gly Ala Ile 610 615 620 Asp Glu Lys Tyr Asp Val Ala Ile Ser Ser Ser Cys Gly Ser Leu Asp 625 630 635 640 Asn Ile Leu Val Asp Thr Ile Asp Thr Ala Gln Lys Cys Val Thr Phe 645 650 655 Leu Lys Ala Gln Asn Ile Gly Val Ala Thr Phe Ile Gly Leu Asp Lys 660 665 670 Met Lys Val Trp Gln Gln Ser Met Gly Ser Ile Ser Thr Pro Glu Asn 675 680 685 Ile Pro Arg Leu Phe Asp Met Val Arg Val Lys Asp Glu Ser Val Arg 690 695 700 Pro Ala Phe Tyr Phe Ala Leu Arg Asp Thr Leu Val Ala Lys Asp Leu 705 710 715 720 Glu Gln Ala Thr Arg Val Ala Phe Gln Lys Asp Lys Arg Trp Arg Val 725 730 735 Val Thr Leu Gln Gly Gln Ile Ile Glu Gln Ala Gly Thr Met Thr Gly 740 745 750 Gly Gly Gly Arg Val Met Lys Gly Met Met Gly Ser Ser Phe Cys Ala 755 760 765 Asp Val Thr Gln Glu Gln Leu Asp Lys Met Glu Ser Ala Leu Asn Lys 770 775 780 Glu Val Thr Gln Leu Gln Asp Cys Gln Lys Arg Lys Asn Gln Leu Glu 785 790 795 800 Glu Lys Val His Lys Ala Arg Cys Glu Leu Arg Asp Met Lys Asn Thr 805 810 815 Leu Glu Lys Tyr Thr Ala Thr Ile Gln Ser Leu Thr Glu Gln Glu Leu 820 825 830 His Leu Lys Pro Gln Ile Lys Asp Leu Glu Ala Asn Val Ile Ala Ala 835 840 845 Ala Pro Asp Lys Ala Lys Gln Lys Gln Met Glu Lys Ser Leu Glu Ala 850 855 860 Tyr Arg Lys Asp Phe Glu Ala Ala Ser Asn Lys Ala Gly Lys Val Glu 865 870 875 880 Ala Glu Val Lys Arg Leu His Thr Leu Ile Val Asp Ile Asn Ser His 885 890 895 Lys Leu Lys Ala Gln Gln Asp Lys Leu Asp Gln Ile Asn Thr Gln Leu 900 905 910 Asp Gln Cys Ser Ser Ala Val Thr Lys Ala Gln Val Ala Ile Lys Thr 915 920 925 Ala Gly Arg Asn Leu Lys Lys Ser Glu Asp Gly Val Ser Arg Leu Glu 930 935 940 Gln Glu Ile Ser Glu Asn Glu Lys Leu Met Glu Glu Leu Thr Glu Gln 945 950 955 960 Leu Lys Lys Leu Glu Glu Gln Ala Gly Glu Ile Met Gln Ser Thr Ser 965 970 975 Ser Gly Glu Gly Phe Ala Arg Gly Ala Gly Ser Ala Ser Cys Ser Gly 980 985 990 Ala Gly Asp Gln Gly Ser Thr Gly Ala Gly Ala Arg Val Ala Glu Arg 995 1000 1005 Val Ser Glu Arg Pro Val Glu Gly Gly Ala His Arg His Gly His His 1010 1015 1020 Gly Val Pro Gln Gln Asp Gln Thr Leu Ala Glu Arg Gly Gln Gln Val 1025 1030 1035 1040 Val Ser Ser Pro His Gly Val Pro Ala Glu Glu Leu Pro Ala Leu Asn 1045 1050 1055 Pro Asp Gln Leu Gln Asp Ile Gly Asn Pro Asp Val Ile Lys Asn Glu 1060 1065 1070 Ile Ala Leu Leu Glu Asp Arg Cys Ala Asn Met Lys Pro Asn Leu Gly 1075 1080 1085 Ala Ile Ala Glu Phe Lys Lys Lys Glu Glu Leu Tyr Leu Gln Arg Val 1090 1095 1100 Ala Glu Leu Asp Asp Ile Thr Thr Gln Arg Asp Ser Phe Lys Arg Gly 1105 1110 1115 1120 Cys Glu Asp Leu Arg Lys Gln Arg Leu His Glu Phe Met Ala Gly Phe 1125 1130 1135 Asn Ile Ile Thr Asn Lys Leu Lys Glu Asn Tyr Gln Met Leu Thr Leu 1140 1145 1150 Gly Gly Asp Ala Glu Leu Glu Leu Val Asp Ser Leu Asp Pro Phe Ser 1155 1160 1165 Glu Gly Ile Met Phe Ser Val Arg Pro Pro Lys Lys Ser Trp Lys Lys 1170 1175 1180 Ile Tyr Asn Leu Ser Gly Gly Glu Lys Thr Leu Ser Ser Leu Ala Leu 1185 1190 1195 1200 Val Phe Ala Leu His His Phe Lys Pro Thr Pro Leu Tyr Phe Met Asp 1205 1210 1215 Glu Ile Asp Ala Ala Leu Asp Phe Lys Asn Val Ser Ile Val Ala Cys 1220 1225 1230 Tyr Ile Tyr Glu Gln Thr Lys Asn Ala Gln Phe Ile Ile Ile Ser Leu 1235 1240 1245 Arg Asn Asn Met Phe Glu Met Ala Asp Arg Leu Ile Gly Ile Tyr Lys 1250 1255 1260 Thr His Asn Pro Thr Lys Asn Val Ala Ile Asn Pro Lys Thr Ile Ile 1265 1270 1275 1280 Leu Arg Glu Ile Gln Thr Ala 1285 115 2931 DNA Danio rerio 115 gtctacagtc gagcttctgt tcaaggcaga cgtgagaaaa tggcggattc ctcagagtcg 60 ttcaacatgg ccaccagccc cactcgaggc tctcggcggg gggatctgac ctccagccct 120 ggcagagacc ttcctccatt tgaggatgag tctgagggac ttctgggaga cactcttcct 180 gatgaggagg acgatgatgg agaggagctg attggagacg ccatggagag agactaccgt 240 gtagtcccag agctggaccg ctatgaagcg gagggtctgg atgaggatga agatctgagt 300 gaactttcac ccagcgctcg tgccgaggca gaagcggcga tgaggagacg cgacagagaa 360 caaggccttg gcatgggccg catcggacgt ggactccttt acgacagtga agacgaggat 420 gacaaacggc ctacgaagag gcagcgtgtc ctggcagaga gagcagccga aggtggagcg 480 atggagggcg aggatgaaga gatgatcgaa agcattgaga acctgggaca tgaagggcca 540 cacggtgcgc gagtgggttt ctatggccgc accacgactg gagatctacc accgcttaag 600 aacttcctgc gcacgcacgt ggatgagcac ggacataatg tgtttaaaga gcgcatcagc 660 gacatgtgca aagaaaataa ggagagtttg ctggtgaact atgaagactg gcttccaaga 720 gcatgtgttg gcgtattttc taccaaagct cctgccgaaa tgctgaagat ctttgacgaa 780 gcagcaaaag aagttgtact tgccatgtac cccaaatacg accgaattgc acatgaaatc 840 cacgtcagaa tcggtaatct accattggga tcactcaggc aactccagct gatccccact 900 agtggtgtcg tgaccaactg cactggagtt cttcctcaac ttgggatggt caaatacaac 960 tgtaacaagt gtaactttat tttggggccg gtctttcagt ctcaaaacca ggaggtgaag 1020 ccgggctcct gtcctgaatg tcagtcgctc gggcccttcg aaatcaacat ggagcagacg 1080 gtgtaccaga actaccagcg catcaccatt caggagagtc ctgggaaagt ggctgctggt 1140 cgcctgcctc gatctaaaga cgccatcctg cttgcagacc tggtggacat gtgtaaaccc 1200 ggagatgaga tcgagctcac tggaatctac cacaacaact atgacggctc tctaaacatg 1260 gccaatggtt tccccgtctt cgccaccgtg atcttagcca atcacatcgc tcgaaaggac 1320 gagggcgtgg ctgtggcgga gctcaccgat gaagacgtca aagccatcgt tgcactgtcc 1380 aaagacgagc gcattggcga aaggattttc gcaagtattg gacccttcat ctacggacac 1440 gaggacatta aacgtggtct ggcacttgct ctgtttggtg gcgaagctaa aaatccaggt 1500 ggaaagcaca aggtgcgtgg tgatataaac gtcctcctgt gtggagatcc cggcacggct 1560 aaatctcagt tcctgaagta cgtggagaag gtggcgagtc gagcggtgtt caccacagga 1620 cagggagcct ccgctgtggg tctgacggct tacgtgcagc ggcatcccgt cagccgagag 1680 tggacgctgg aggccggagc gttagtgctg gcggacagag gagtgtgtct tattgatgag 1740 ttcgataaga tgaacgatca ggacagaacc agcattcacg aggcgatgga gcagcagagc 1800 atctccattt ccaaagcagg aatcgtgact tctctccagg cccgctgcac cgtcatcgct 1860 gccgccaatc ccatcggtgg acgctacgac ccgtctctaa cgttttcgga aaacgtggac 1920 ctgacggagc ccatcatttc tcgatttgat gttctgtgtg ttgtaagaga cactgtcgat 1980 cctgtgcagg atgaaatgtt ggctcgcttt gtggtcggca gccacatcaa gcatcaccct 2040 agcaacaaag aagggggcgt ggccggactg gaggaggtgg tgcttcccaa cacattcgac 2100 gtcccaccaa tcccgcagga gttgctccgc aagtacatca tttacgctaa agagcgcgtg 2160 aggcccaaac tcaaccagat ggaccaggac aaagtggccc gcatttacag cgacctacga 2220 aaagagtcta tggctacagg aagtatcccg attactgtgc gtcacatcga gtctatgatc 2280 cgtatggccg aggctcacgc tcgcatgcat ctgcgcgact acgtgctgga ggacgacgtc 2340 aacatggcca tccgcgtcat gctagagagc ttcatcgaca cgcagaagtt cagtgtgatg 2400 aggagcatga gaaagacgtt cgcacggtat ctggccttca gacgagacaa caacgagctg 2460 ctactgttca tccttaagca gctggtttct gagcaggtct catatcagcg caaccgttac 2520 ggagcccagc aggacaccat cgagatcgca gagaaagacc tggtggacaa ggctcgacag 2580 atcaacatcc acagcctgtc agcgttttac gacagcgatc tcttccgctc aaacaaattc 2640 tcccacgatg tcaagaagaa gctcatcgtt cagcagttct agcctgcatc tggaatctcc 2700 cggacttgta tataatttcg tcacgtggtc gttggaccag tcgttcgtta cacatggatt 2760 ccacattttg acccattcat gaacttgcta ccatatagat ctgcctggct ctcgtggttt 2820 tgttgactgt atcttgcatt gtttctgctg tcaaattaat tgaatgttgt aaaaaaaaat 2880 atgagagaaa tttttgtact gtttgttcag agtaataaac tgaagctgtg t 2931 116 880 PRT Danio rerio 116 Met Ala Asp Ser Ser Glu Ser Phe Asn Met Ala Thr Ser Pro Thr Arg 1 5 10 15 Gly Ser Arg Arg Gly Asp Leu Thr Ser Ser Pro Gly Arg Asp Leu Pro 20 25 30 Pro Phe Glu Asp Glu Ser Glu Gly Leu Leu Gly Asp Thr Leu Pro Asp 35 40 45 Glu Glu Asp Asp Asp Gly Glu Glu Leu Ile Gly Asp Ala Met Glu Arg 50 55 60 Asp Tyr Arg Val Val Pro Glu Leu Asp Arg Tyr Glu Ala Glu Gly Leu 65 70 75 80 Asp Glu Asp Glu Asp Leu Ser Glu Leu Ser Pro Ser Ala Arg Ala Glu 85 90 95 Ala Glu Ala Ala Met Arg Arg Arg Asp Arg Glu Gln Gly Leu Gly Met 100 105 110 Gly Arg Ile Gly Arg Gly Leu Leu Tyr Asp Ser Glu Asp Glu Asp Asp 115 120 125 Lys Arg Pro Thr Lys Arg Gln Arg Val Leu Ala Glu Arg Ala Ala Glu 130 135 140 Gly Gly Ala Met Glu Gly Glu Asp Glu Glu Met Ile Glu Ser Ile Glu 145 150 155 160 Asn Leu Gly His Glu Gly Pro His Gly Ala Arg Val Gly Phe Tyr Gly 165 170 175 Arg Thr Thr Thr Gly Asp Leu Pro Pro Leu Lys Asn Phe Leu Arg Thr 180 185 190 His Val Asp Glu His Gly His Asn Val Phe Lys Glu Arg Ile Ser Asp 195 200 205 Met Cys Lys Glu Asn Lys Glu Ser Leu Leu Val Asn Tyr Glu Asp Trp 210 215 220 Leu Pro Arg Ala Cys Val Gly Val Phe Ser Thr Lys Ala Pro Ala Glu 225 230 235 240 Met Leu Lys Ile Phe Asp Glu Ala Ala Lys Glu Val Val Leu Ala Met 245 250 255 Tyr Pro Lys Tyr Asp Arg Ile Ala His Glu Ile His Val Arg Ile Gly 260 265 270 Asn Leu Pro Leu Gly Ser Leu Arg Gln Leu Gln Leu Ile Pro Thr Ser 275 280 285 Gly Val Val Thr Asn Cys Thr Gly Val Leu Pro Gln Leu Gly Met Val 290 295 300 Lys Tyr Asn Cys Asn Lys Cys Asn Phe Ile Leu Gly Pro Val Phe Gln 305 310 315 320 Ser Gln Asn Gln Glu Val Lys Pro Gly Ser Cys Pro Glu Cys Gln Ser 325 330 335 Leu Gly Pro Phe Glu Ile Asn Met Glu Gln Thr Val Tyr Gln Asn Tyr 340 345 350 Gln Arg Ile Thr Ile Gln Glu Ser Pro Gly Lys Val Ala Ala Gly Arg 355 360 365 Leu Pro Arg Ser Lys Asp Ala Ile Leu Leu Ala Asp Leu Val Asp Met 370 375 380 Cys Lys Pro Gly Asp Glu Ile Glu Leu Thr Gly Ile Tyr His Asn Asn 385 390 395 400 Tyr Asp Gly Ser Leu Asn Met Ala Asn Gly Phe Pro Val Phe Ala Thr 405 410 415 Val Ile Leu Ala Asn His Ile Ala Arg Lys Asp Glu Gly Val Ala Val 420 425 430 Ala Glu Leu Thr Asp Glu Asp Val Lys Ala Ile Val Ala Leu Ser Lys 435 440 445 Asp Glu Arg Ile Gly Glu Arg Ile Phe Ala Ser Ile Gly Pro Phe Ile 450 455 460 Tyr Gly His Glu Asp Ile Lys Arg Gly Leu Ala Leu Ala Leu Phe Gly 465 470 475 480 Gly Glu Ala Lys Asn Pro Gly Gly Lys His Lys Val Arg Gly Asp Ile 485 490 495 Asn Val Leu Leu Cys Gly Asp Pro Gly Thr Ala Lys Ser Gln Phe Leu 500 505 510 Lys Tyr Val Glu Lys Val Ala Ser Arg Ala Val Phe Thr Thr Gly Gln 515 520 525 Gly Ala Ser Ala Val Gly Leu Thr Ala Tyr Val Gln Arg His Pro Val 530 535 540 Ser Arg Glu Trp Thr Leu Glu Ala Gly Ala Leu Val Leu Ala Asp Arg 545 550 555 560 Gly Val Cys Leu Ile Asp Glu Phe Asp Lys Met Asn Asp Gln Asp Arg 565 570 575 Thr Ser Ile His Glu Ala Met Glu Gln Gln Ser Ile Ser Ile Ser Lys 580 585 590 Ala Gly Ile Val Thr Ser Leu Gln Ala Arg Cys Thr Val Ile Ala Ala 595 600 605 Ala Asn Pro Ile Gly Gly Arg Tyr Asp Pro Ser Leu Thr Phe Ser Glu 610 615 620 Asn Val Asp Leu Thr Glu Pro Ile Ile Ser Arg Phe Asp Val Leu Cys 625 630 635 640 Val Val Arg Asp Thr Val Asp Pro Val Gln Asp Glu Met Leu Ala Arg 645 650 655 Phe Val Val Gly Ser His Ile Lys His His Pro Ser Asn Lys Glu Gly 660 665 670 Gly Val Ala Gly Leu Glu Glu Val Val Leu Pro Asn Thr Phe Asp Val 675 680 685 Pro Pro Ile Pro Gln Glu Leu Leu Arg Lys Tyr Ile Ile Tyr Ala Lys 690 695 700 Glu Arg Val Arg Pro Lys Leu Asn Gln Met Asp Gln Asp Lys Val Ala 705 710 715 720 Arg Ile Tyr Ser Asp Leu Arg Lys Glu Ser Met Ala Thr Gly Ser Ile 725 730 735 Pro Ile Thr Val Arg His Ile Glu Ser Met Ile Arg Met Ala Glu Ala 740 745 750 His Ala Arg Met His Leu Arg Asp Tyr Val Leu Glu Asp Asp Val Asn 755 760 765 Met Ala Ile Arg Val Met Leu Glu Ser Phe Ile Asp Thr Gln Lys Phe 770 775 780 Ser Val Met Arg Ser Met Arg Lys Thr Phe Ala Arg Tyr Leu Ala Phe 785 790 795 800 Arg Arg Asp Asn Asn Glu Leu Leu Leu Phe Ile Leu Lys Gln Leu Val 805 810 815 Ser Glu Gln Val Ser Tyr Gln Arg Asn Arg Tyr Gly Ala Gln Gln Asp 820 825 830 Thr Ile Glu Ile Ala Glu Lys Asp Leu Val Asp Lys Ala Arg Gln Ile 835 840 845 Asn Ile His Ser Leu Ser Ala Phe Tyr Asp Ser Asp Leu Phe Arg Ser 850 855 860 Asn Lys Phe Ser His Asp Val Lys Lys Lys Leu Ile Val Gln Gln Phe 865 870 875 880 117 623 DNA Danio rerio 117 aaacctcaac gacctgcgac gacgaaatga aaagcgagca gcaaagctgt tgaataacgc 60 ttttgaagag ctcctggcat tccagcgagc cctgaaagat ctggtggctt caatagatgc 120 cacttacgcc aagcaatgtg aggagttctt cattggtttg gagggcagct ttgggagcaa 180 acatgtctcc cctcgaaccc tgacctctcg cctgctgggc agcatggtgt gtttggaggg 240 catcgtcact aaatgctctt tggtgcgtcc caaagtagta cgcagtgtgc actactgtcc 300 agccactaaa aagaccatgg agcgcaaata caccgatctg acttctctgg atgccttccc 360 ttcaagtgct atttacccca ccaaggatga agagaacaat ccattggaga cagagtttgg 420 tctgtccgtc tataaggacc accaaaccat cactatacag gagatgccgg agaaggcccc 480 tgctggtcag ctgccccgct ctgtggacat catcctggac aatgacctgg tggacgcccg 540 tgaaacctgg agaccgaaca caggtgatcg gcacctaccg ctgcctgccg ggcaaaaaag 600 gtggcttacc tcgggcacct tca 623 118 178 PRT Danio rerio 118 Asn Leu Asn Asp Leu Arg Arg Arg Asn Glu Lys Arg Ala Ala Lys Leu 1 5 10 15 Leu Asn Asn Ala Phe Glu Glu Leu Leu Ala Phe Gln Arg Ala Leu Lys 20 25 30 Asp Leu Val Ala Ser Ile Asp Ala Thr Tyr Ala Lys Gln Cys Glu Glu 35 40 45 Phe Phe Ile Gly Leu Glu Gly Ser Phe Gly Ser Lys His Val Ser Pro 50 55 60 Arg Thr Leu Thr Ser Arg Leu Leu Gly Ser Met Val Cys Leu Glu Gly 65 70 75 80 Ile Val Thr Lys Cys Ser Leu Val Arg Pro Lys Val Val Arg Ser Val 85 90 95 His Tyr Cys Pro Ala Thr Lys Lys Thr Met Glu Arg Lys Tyr Thr Asp 100 105 110 Leu Thr Ser Leu Asp Ala Phe Pro Ser Ser Ala Ile Tyr Pro Thr Lys 115 120 125 Asp Glu Glu Asn Asn Pro Leu Glu Thr Glu Phe Gly Leu Ser Val Tyr 130 135 140 Lys Asp His Gln Thr Ile Thr Ile Gln Glu Met Pro Glu Lys Ala Pro 145 150 155 160 Ala Gly Gln Leu Pro Arg Ser Val Asp Ile Ile Leu Asp Asn Asp Leu 165 170 175 Val Asp 119 2915 DNA Danio rerio 119 aggctgacgt ttccgtactg ctgcctacag atggacactc tctacatctc ttcccaccct 60 gatgacttca ggagtttgtt gcccttgatc gctgctggat tctgtccgtc tcctccacga 120 attctccacc aggaccctcc agcagaggtg gccagatctc ggcctgcgct ggttttagca 180 ggtgcggggg gcactgtgct cagcggcacc agtgctgtga gttggtatct ggcagcaaca 240 ggcaaacgag ccggaagtga taaaaagcag gagagtcagg tgtggcagtg gctgagtttt 300 gctgagaatg agctgacgcc ggtggcctgc gctgtggcct tccctctgct tggaatcatg 360 ggagttgata agaagcttca gcagagttcc agagcggagc tgctgcgtgt tctgaaagct 420 ctggatggga ctcttgcgct tcggacattc ttggtgggag agagtgttac tctggcggat 480 gcacctgttg cgatggcggc tttgctgcca ttcaaatata cactggaccc tgcaaacagg 540 aagtcgttgg tgaatgtgac caggtggttt aatacctgtg taaatcagcc acagtttctg 600 aaggtattgg gtaaaatctc actatgtgag aaaatggtgc ctgtgacccc caaacccaac 660 actgcttcca atgtcacagt aactgctgct gctgctgcta aaccagacag cggacctgct 720 aatggtcctc ctaagacaga agctcaactg aagaaagagg ccaagaagcg agaaaagatg 780 gagaagtttc aacagaagaa agacatggag gagaagaaga aaatgcagcc tcagactgag 840 aaaaaggcca aaccagagaa gaaggaactg ggtgtgatca cctacaatgt ctccacacca 900 cctggcgaga aaaaagatgt cctgagtcct ctgcctgatt catacagtcc tcagtatgtg 960 gaagcggcct ggtactcctg gtgggagaaa caaggatttt tcaagcctga atctgagagg 1020 aagaaactga gtgagccgaa tccgcgtgga gtgtttatga tgtgtatccc accacccaat 1080 gtgactggat cccttcacct tggtcatgcc ttgaccaatg ctattcagga ctgcttgacg 1140 agatggcaca gaatgcgaga agagacaaca ctgtgggatc ccggctgtga tcacgctggc 1200 atcgccacac aggtggtggt ggagaagaaa ctcgtgagag agcggaagat gaccagacac 1260 gaccttgggc gagagaactt catcaaggag gtctggaagt ggaagaagcc agagaaaaag 1320 ctgccaaacg cttcaatgtc actgcagaca aaatctctct gcgccaggat gaggacgtgt 1380 tggacacatg gttctcctcg gggatcttcc ccttctccat ctttggctgg ccaaatgagt 1440 ctgaggatct gagagtcttc tacccgggca cactgttaga gacgggtcat gatatcttgt 1500 tcttctgggt tgctcgaatg gtcatgatgg gcctcaaact cactggaaaa ctacctttca 1560 aagaggtgta cctgcatgca gtggtgcgag atgctcatgg caggaaaatg agcaagtctc 1620 ttggtaacgt catcgaccct ctggacgtca tcaccggcat atcactggag ggtctttatg 1680 ctcagctagc agacagtaat ctggatcctc tggagattga gaaagccaaa cagggacaga 1740 aatccgactt atcccactgg aattccagaa tgtgggactg atgcgcttag atttgctctg 1800 tgtgcatacc ccagtcaagg tcgagacatt aatctggatg ttaacaggat tttgggatac 1860 aggcatttct gtaacaagct gtggaacgct gtgaagttcg ccatgaggac tctgggagag 1920 gggtttgtgc cttgtgagaa agctcagctg tgtggaagtg aaagtgtctc tgatcgctgg 1980 attctgtctc gcctgagcgc agcagtagcg ttgtgtgatg gcggatttaa ggcctacgac 2040 tttcccacaa tcaccactgc catctacaat ttctggctgt atgagctctg tgacgtctat 2100 ttggagtgtg tgaagccggt gtttagcagg acagactctg agggtcagaa acaggctgac 2160 gtctgcagac agaccctcta cacctgtctg gaggtgggac tgcgcctcct gtcccccatc 2220 atgccttttg tttctgaaga gctcttccag cgtttgccaa ggagaaggcc tcgggacagc 2280 cttccagcat ctctgtcaca cctatcctga cactgctgag ttctgctggc acagtgagga 2340 tgtggaccgt cagatggagt ttgtcatgtc tgtggtcaga accattcgct cactcagggc 2400 tgattacaac ctgactaaga ctcgcgctga ctgtttcctg cagtgtatcg actccgaaac 2460 ggccgctctg gttcagaaat acagtcttca gatccagacc ctgtcctact ctcaggccgt 2520 tcactctgtg gtcggagatg ccgctatccc acaaggctgt gcggtggcca tcgcttcaga 2580 taaatgcact gtcaacctca tgctgaaggg tttgattgac ttgggaaagg aggtcacgaa 2640 actgacggcc aagaaaggcg agctggagaa acagatggag aagatgaagg aaaagatgac 2700 aaagaacgac tacaaggaga aagttcctgt gaaagtgcag gaggctgatg ctgagaagct 2760 gaggcagagt gaagcagagc tgcagaaggt gaaggaagct atagagaact tcagcaagat 2820 gatgtaacat ttcaccctca tgcttttttt ttgtatcgtt tatgaatttt tccattgaaa 2880 ttcgttcctt tgtgttcaat gtcagtcatg cctag 2915 120 957 PRT Danio rerio 120 Met Asp Thr Leu Tyr Ile Ser Ser His Pro Asp Asp Phe Arg Ser Leu 1 5 10 15 Leu Pro Leu Ile Ala Ala Gly Phe Cys Pro Ser Pro Pro Arg Ile Leu 20 25 30 His Gln Asp Pro Pro Ala Glu Val Ala Arg Ser Arg Pro Ala Leu Val 35 40 45 Leu Ala Gly Ala Gly Gly Thr Val Leu Ser Gly Thr Ser Ala Val Ser 50 55 60 Trp Tyr Leu Ala Ala Thr Gly Lys Arg Ala Gly Ser Asp Lys Lys Gln 65 70 75 80 Glu Ser Gln Val Trp Gln Trp Leu Ser Phe Ala Glu Asn Glu Leu Thr 85 90 95 Pro Val Ala Cys Ala Val Ala Phe Pro Leu Leu Gly Ile Met Gly Val 100 105 110 Asp Lys Lys Leu Gln Gln Ser Ser Arg Ala Glu Leu Leu Arg Val Leu 115 120 125 Lys Ala Leu Asp Gly Thr Leu Ala Leu Arg Thr Phe Leu Val Gly Glu 130 135 140 Ser Val Thr Leu Ala Asp Ala Pro Val Ala Met Ala Ala Leu Leu Pro 145 150 155 160 Phe Lys Tyr Thr Leu Asp Pro Ala Asn Arg Lys Ser Leu Val Asn Val 165 170 175 Thr Arg Trp Phe Asn Thr Cys Val Asn Gln Pro Gln Phe Leu Lys Val 180 185 190 Leu Gly Lys Ile Ser Leu Cys Glu Lys Met Val Pro Val Thr Pro Lys 195 200 205 Pro Asn Thr Ala Ser Asn Val Thr Val Thr Ala Ala Ala Ala Ala Lys 210 215 220 Pro Asp Ser Gly Pro Ala Asn Gly Pro Pro Lys Thr Glu Ala Gln Leu 225 230 235 240 Lys Lys Glu Ala Lys Lys Arg Glu Lys Met Glu Lys Phe Gln Gln Lys 245 250 255 Lys Asp Met Glu Glu Lys Lys Lys Met Gln Pro Gln Thr Glu Lys Lys 260 265 270 Ala Lys Pro Glu Lys Lys Glu Leu Gly Val Ile Thr Tyr Asn Val Ser 275 280 285 Thr Pro Pro Gly Glu Lys Lys Asp Val Leu Ser Pro Leu Pro Asp Ser 290 295 300 Tyr Ser Pro Gln Tyr Val Glu Ala Ala Trp Tyr Ser Trp Trp Glu Lys 305 310 315 320 Gln Gly Phe Phe Lys Pro Glu Ser Glu Arg Lys Lys Leu Ser Glu Pro 325 330 335 Asn Pro Arg Gly Val Phe Met Met Cys Ile Pro Pro Pro Asn Val Thr 340 345 350 Gly Ser Leu His Leu Gly His Ala Leu Thr Asn Ala Ile Gln Asp Cys 355 360 365 Leu Thr Arg Trp His Arg Met Arg Glu Glu Thr Thr Leu Trp Asp Pro 370 375 380 Gly Cys Asp His Ala Gly Ile Ala Thr Gln Val Val Val Glu Lys Lys 385 390 395 400 Leu Val Arg Glu Arg Lys Met Thr Arg His Asp Leu Gly Arg Glu Asn 405 410 415 Phe Ile Lys Glu Val Trp Lys Trp Lys Lys Pro Glu Lys Lys Leu Pro 420 425 430 Asn Ala Ser Met Ser Leu Gln Thr Lys Ser Leu Cys Ala Arg Met Arg 435 440 445 Thr Cys Trp Thr His Gly Ser Pro Arg Gly Ser Ser Pro Ser Pro Ser 450 455 460 Leu Ala Gly Gln Met Ser Leu Arg Ile Val Phe Tyr Pro Gly Thr Leu 465 470 475 480 Leu Glu Thr Gly His Asp Ile Leu Phe Phe Trp Val Ala Arg Met Val 485 490 495 Met Met Gly Leu Lys Leu Thr Gly Lys Leu Pro Phe Lys Glu Val Tyr 500 505 510 Leu His Ala Val Val Arg Asp Ala His Gly Arg Lys Met Ser Lys Ser 515 520 525 Leu Gly Asn Val Ile Asp Pro Leu Asp Val Ile Thr Gly Ile Ser Leu 530 535 540 Glu Gly Leu Tyr Ala Gln Leu Ala Asp Ser Asn Leu Asp Pro Leu Glu 545 550 555 560 Ile Glu Lys Ala Lys Gln Gly Gln Lys Ser Asp Leu Ser His Trp Asn 565 570 575 Ser Arg Met Trp Asp Glx Cys Ala Cys Ala Tyr Pro Ser Gln Gly Arg 580 585 590 Asp Ile Asn Leu Asp Val Asn Arg Ile Leu Gly Tyr Arg His Phe Cys 595 600 605 Asn Lys Leu Trp Asn Ala Val Lys Phe Ala Met Arg Thr Leu Gly Glu 610 615 620 Gly Phe Val Pro Cys Glu Lys Ala Gln Leu Cys Gly Ser Glu Ser Val 625 630 635 640 Ser Asp Arg Trp Ile Leu Ser Arg Leu Ser Ala Ala Val Ala Leu Cys 645 650 655 Asp Gly Gly Phe Lys Ala Tyr Asp Phe Pro Thr Ile Thr Thr Ala Ile 660 665 670 Tyr Asn Phe Trp Leu Tyr Glu Leu Cys Asp Val Tyr Leu Glu Cys Val 675 680 685 Lys Pro Val Phe Ser Arg Thr Asp Ser Glu Gly Gln Lys Gln Ala Asp 690 695 700 Val Cys Arg Gln Thr Leu Tyr Thr Cys Leu Glu Val Gly Leu Arg Leu 705 710 715 720 Leu Ser Pro Ile Met Pro Phe Val Ser Glu Glu Leu Phe Gln Arg Leu 725 730 735 Pro Arg Arg Arg Pro Arg Asp Ser Leu Pro Ala Ser Leu Ser His Leu 740 745 750 Ser Glx His Cys Glx Val Leu Leu Ala Gln Glx Gly Cys Gly Pro Ser 755 760 765 Asp Gly Val Cys His Val Cys Gly Gln Asn His Ser Leu Thr Gln Tyr 770 775 780 Pro Asp Thr Ala Glu Phe Cys Trp His Ser Glu Asp Val Asp Arg Gln 785 790 795 800 Met Glu Phe Val Met Ser Val Val Arg Thr Ile Arg Ser Leu Arg Ala 805 810 815 Asp Tyr Asn Leu Thr Lys Thr Arg Ala Asp Cys Phe Leu Gln Cys Ile 820 825 830 Asp Ser Glu Thr Ala Ala Leu Val Gln Lys Tyr Ser Leu Gln Ile Gln 835 840 845 Thr Leu Ser Tyr Ser Gln Ala Val His Ser Val Val Gly Asp Ala Ala 850 855 860 Ile Pro Gln Gly Cys Ala Val Ala Ile Ala Ser Asp Lys Cys Thr Val 865 870 875 880 Asn Leu Met Leu Lys Gly Leu Ile Asp Leu Gly Lys Glu Val Thr Lys 885 890 895 Leu Thr Ala Lys Lys Gly Glu Leu Glu Lys Gln Met Glu Lys Met Lys 900 905 910 Glu Lys Met Thr Lys Asn Asp Tyr Lys Glu Lys Val Pro Val Lys Val 915 920 925 Gln Glu Ala Asp Ala Glu Lys Leu Arg Gln Ser Glu Ala Glu Leu Gln 930 935 940 Lys Val Lys Glu Ala Ile Glu Asn Phe Ser Lys Met Met 945 950 955 121 645 DNA Danio rerio 121 ctcgagtgcg ccattcacac acacgctgcg aagatggctg aagattggga ggctgcacca 60 gctgttgctg aaaccccaga gatcaaactc tttggcaagt ggagcacaga tgatgtacag 120 atcaatgaca tctccctgca ggattacatt gccgtgaagg agaaatacgc caaatacctc 180 cctcattcca gcggcagata cgcagccaag cgtttccgta aggctcagtg tcccattgtg 240 gagcgcgtca ctaactccat gatgatgcac ggacgcaaca acggcaagaa actgctgacc 300 gttcgcatcg tcaagcacgc ttttgagatc atccacctgc tcactggcga gaatcctctg 360 cagattctgg taaacgccat catcaacagt ggccctcgtg aggactccac ccgtatcgga 420 cgtgctggaa ccgtgaggag acaggctgtt gatgtgtccc ccctgcgcag agtcaaccag 480 gccatctggc tgctctgcac tggtgcaaga gaagctgctt tcaggaacat caagaccatc 540 gcagagtgtc ttgctgacga gctcatcaac gctgccaagg gttcctacaa ctcttacgcc 600 atcaagaaga aagatgagct ggagagagtg gccaagtcta accgc 645 122 204 PRT Danio rerio 122 Met Ala Glu Asp Trp Glu Ala Ala Pro Ala Val Ala Glu Thr Pro Glu 1 5 10 15 Ile Lys Leu Phe Gly Lys Trp Ser Thr Asp Asp Val Gln Ile Asn Asp 20 25 30 Ile Ser Leu Gln Asp Tyr Ile Ala Val Lys Glu Lys Tyr Ala Lys Tyr 35 40 45 Leu Pro His Ser Ser Gly Arg Tyr Ala Ala Lys Arg Phe Arg Lys Ala 50 55 60 Gln Cys Pro Ile Val Glu Arg Val Thr Asn Ser Met Met Met His Gly 65 70 75 80 Arg Asn Asn Gly Lys Lys Leu Leu Thr Val Arg Ile Val Lys His Ala 85 90 95 Phe Glu Ile Ile His Leu Leu Thr Gly Glu Asn Pro Leu Gln Ile Leu 100 105 110 Val Asn Ala Ile Ile Asn Ser Gly Pro Arg Glu Asp Ser Thr Arg Ile 115 120 125 Gly Arg Ala Gly Thr Val Arg Arg Gln Ala Val Asp Val Ser Pro Leu 130 135 140 Arg Arg Val Asn Gln Ala Ile Trp Leu Leu Cys Thr Gly Ala Arg Glu 145 150 155 160 Ala Ala Phe Arg Asn Ile Lys Thr Ile Ala Glu Cys Leu Ala Asp Glu 165 170 175 Leu Ile Asn Ala Ala Lys Gly Ser Ser Asn Ser Tyr Ala Ile Lys Lys 180 185 190 Lys Asp Glu Leu Glu Arg Val Ala Lys Ser Asn Arg 195 200 123 1106 DNA Danio rerio 123 cctccctgtg ctgttcttcc tgtcgcgtgc gggtctcctc agcacatcta atcctcatca 60 tggcgtctct atcgatggct ccggtcagca ttttcaagca tggagcggat gaggagaagg 120 cagaaaccgc tcgcctgtcg tccttcatcg gcgccatcgc catcggggat ctggtgaaga 180 gcactctggg gcccaaagga atggataaga tcctcctggg cgggggcaga gagagcaccg 240 tgacggtgac caatgacgga gccaccatcc tcaaggccat cggggtggac aaccctgctg 300 ctaaagtgct cgtggacatg tctaaggtcc aggatgatga ggttggtgac aggacgacct 360 cggtcactgt gctggctgct gagctgctgc gggaggcaga gctgctgatc gccaagaaga 420 ttcatcctca gatcatcatc gccggctgga ggaaggccac acaggcggcc cgcgacgccc 480 tcagagaggc cgcagtggat catgggagtg acgaggtgaa gttccaggag gatctgctga 540 acatctcccg caccacgctc tcctccaagc tgctgacgca ccataaggac cacttcgcca 600 agctggcggt gcaggcggtg atgcggctcc ggggatctgg caacctggag gccattcacc 660 tgatcaagaa gctgggcggc agcctcactg actcatacct ggacgagggg ttcctgctgg 720 acaagcgtat cggggtcaat cagcccaaac gcatcgagaa cgctaacatc ctcatcgcca 780 acaccggcat ggacaccgac aagatcaaga tcttcggctc cagagtgcgt gtggactcca 840 ctgctaaagt ggcagagatc gagacggcag agaaggagaa gatgaaggag aaggtggagc 900 gaatcctcaa acacggcatc aactgcttca tcaacaggca gctgatctac aattacccgg 960 agcaattgtt tgcggccgca ggcgtcatgg ctatcgaaca cgcagatttc acaggagtgg 1020 agcgtctggc tctcgtcaca ggtggggaaa tcacctccac ctttgaccac cccgactggt 1080 gaactgggcc actgtaagct gatcga 1106 124 340 PRT Danio rerio 124 Met Ala Ser Leu Ser Met Ala Pro Val Ser Ile Phe Lys His Gly Ala 1 5 10 15 Asp Glu Glu Lys Ala Glu Thr Ala Arg Leu Ser Ser Phe Ile Gly Ala 20 25 30 Ile Ala Ile Gly Asp Leu Val Lys Ser Thr Leu Gly Pro Lys Gly Met 35 40 45 Asp Lys Ile Leu Leu Gly Gly Gly Arg Glu Ser Thr Val Thr Val Thr 50 55 60 Asn Asp Gly Ala Thr Ile Leu Lys Ala Ile Gly Val Asp Asn Pro Ala 65 70 75 80 Ala Lys Val Leu Val Asp Met Ser Lys Val Gln Asp Asp Glu Val Gly 85 90 95 Asp Arg Thr Thr Ser Val Thr Val Leu Ala Ala Glu Leu Leu Arg Glu 100 105 110 Ala Glu Leu Leu Ile Ala Lys Lys Ile His Pro Gln Ile Ile Ile Ala 115 120 125 Gly Trp Arg Lys Ala Thr Gln Ala Ala Arg Asp Ala Leu Arg Glu Ala 130 135 140 Ala Val Asp His Gly Ser Asp Glu Val Lys Phe Gln Glu Asp Leu Leu 145 150 155 160 Asn Ile Ser Arg Thr Thr Leu Ser Ser Lys Leu Leu Thr His His Lys 165 170 175 Asp His Phe Ala Lys Leu Ala Val Gln Ala Val Met Arg Leu Arg Gly 180 185 190 Ser Gly Asn Leu Glu Ala Ile His Leu Ile Lys Lys Leu Gly Gly Ser 195 200 205 Leu Thr Asp Ser Tyr Leu Asp Glu Gly Phe Leu Leu Asp Lys Arg Ile 210 215 220 Gly Val Asn Gln Pro Lys Arg Ile Glu Asn Ala Asn Ile Leu Ile Ala 225 230 235 240 Asn Thr Gly Met Asp Thr Asp Lys Ile Lys Ile Phe Gly Ser Arg Val 245 250 255 Arg Val Asp Ser Thr Ala Lys Val Ala Glu Ile Glu Thr Ala Glu Lys 260 265 270 Glu Lys Met Lys Glu Lys Val Glu Arg Ile Leu Lys His Gly Ile Asn 275 280 285 Cys Phe Ile Asn Arg Gln Leu Ile Tyr Asn Tyr Pro Glu Gln Leu Phe 290 295 300 Ala Ala Ala Gly Val Met Ala Ile Glu His Ala Asp Phe Thr Gly Val 305 310 315 320 Glu Arg Leu Ala Leu Val Thr Gly Gly Glu Ile Thr Ser Thr Phe Asp 325 330 335 His Pro Asp Trp 340 125 2070 DNA Danio rerio 125 ggcaccgtcc tcctctattc cgcaatcatg atgtccactc cagtcatcct cttgaaagag 60 ggcacagaca cctctcaggg ggtcccacaa ctggtcagca acataaatgc ctgccaggtt 120 gtggcagagg ctgtgcggac cacccttggc ccccgtggca tggacaagct tgtggtggat 180 aaccgaggca aagccactat ttctaatgat ggagccacaa ttctgaagct tttggatgtt 240 gtgcatcctg cagccaagac tctggtggac attgctagat ctcaagatgc tgaggtcgga 300 gatggtacca cttcagtgac tctgcttgct gctgagtttc tgaagcagtt gaaaccgtat 360 gtggaagaag ggcttcaccc acagaccatc atcagagcat tccgcatcgc cacccaactt 420 gctgtcaaaa agatcaaaga aatcgctgtt accatcaaaa aggatgacaa acaagaacag 480 aggaggttgt tggagaagtg tgctgctaca gctttgaact ccaagctgat agcagggcag 540 aaggatttct tctccaagat ggtggtggat gcagtgatga tgctggatga tctgctgcct 600 ctgaagatga ttggagtgaa gaaggtgcag ggtggtgctc tggaggagtc tcagcttgtg 660 gctggtgtgg catttaagaa gactttctct tatgctggtt ttgagatgca gcccaagcgt 720 tacatgaacc caaaaattgc cctgctcaac attgagctgg agttgaaggc agagaaggac 780 aatgccgagg ttcgcgtcaa ctcaatggag gactatcagg ccattgttga tgctgaatgg 840 aacatcctgt atgataaact ggagaagatc cacaaatctg gcgctaaagt tgtgctgtcc 900 aagctgccca ttggagatgt agccacacag tactttgcat acagaaatct gttctgtgca 960 gcccgcgtcg tggaggaaaa tctcaaaaga actatgatgg cttgtggtgg ctccattcag 1020 accagtgttg gttccctgac tgatgatgtt cttggccagt gtgagctatt tgaagaagtg 1080 caggttggag gagagagata caatttcttt aaaggctgcc caaaggccaa gacctgcacc 1140 atcattctga ggggtggtgc agagcagttt atggaagaga cggaccgctc actgcatgat 1200 gccattatga tagtgcgcag ggcaatcaag aatgactcca ttgttgccgg tggtggggca 1260 attgagatgg agttgtcgaa gtatctgagg gattattcta gaacaattcc agggaagcag 1320 caattgctga tcggagccta tgccaaggcc ctggagatca ttcccagaca gctctgtgac 1380 aatgcgggat ttgatgccac aaatattcta aacaaactga gggccaagca tgcacagggt 1440 ggtatgtggt atggagtgga tgtgaataat gaagacatag cagataactt ccaggcatgt 1500 gtttgggagc cctctatagt gcgtatcaat gccttgactg ctgcatctga agctgcatgc 1560 ctcatactgt cagtggatga gaccatcaag aaccctcgct ccagtgttga tggcccaccg 1620 gcagctgctg gcagaggaag gggtcgtggt agaccagccc atgcccacta atccatatcc 1680 agccttagga ctttcagttg atcacaacaa atctttgccg ttcacttggt tttgatttgt 1740 tttattggcc cgttttgtta tttaccagtc ccactaacta cgagaatctc tgtgtttgca 1800 aaggctaatg cagaataaaa tggcgttttt ttttctctct ccttgtttta gaccaatggg 1860 aactgacatc ttttctattg ggtatttgga attttgaatg ttgttttgta atagtctgag 1920 aacacacaaa aaacaccatg tgctcaaaat tggttggttt gtagatctga gtaagcactg 1980 aaatgcgggc actctgctga gatatgttcc tgtgttggtc agttggtgga actatttatt 2040 gtaccattaa aatcagtact tttgcctcaa 2070 126 547 PRT Danio rerio 126 Met Met Ser Thr Pro Val Ile Leu Leu Lys Glu Gly Thr Asp Thr Ser 1 5 10 15 Gln Gly Val Pro Gln Leu Val Ser Asn Ile Asn Ala Cys Gln Val Val 20 25 30 Ala Glu Ala Val Arg Thr Thr Leu Gly Pro Arg Gly Met Asp Lys Leu 35 40 45 Val Val Asp Asn Arg Gly Lys Ala Thr Ile Ser Asn Asp Gly Ala Thr 50 55 60 Ile Leu Lys Leu Leu Asp Val Val His Pro Ala Ala Lys Thr Leu Val 65 70 75 80 Asp Ile Ala Arg Ser Gln Asp Ala Glu Val Gly Asp Gly Thr Thr Ser 85 90 95 Val Thr Leu Leu Ala Ala Glu Phe Leu Lys Gln Leu Lys Pro Tyr Val 100 105 110 Glu Glu Gly Leu His Pro Gln Thr Ile Ile Arg Ala Phe Arg Ile Ala 115 120 125 Thr Gln Leu Ala Val Lys Lys Ile Lys Glu Ile Ala Val Thr Ile Lys 130 135 140 Lys Asp Asp Lys Gln Glu Gln Arg Arg Leu Leu Glu Lys Cys Ala Ala 145 150 155 160 Thr Ala Leu Asn Ser Lys Leu Ile Ala Gly Gln Lys Asp Phe Phe Ser 165 170 175 Lys Met Val Val Asp Ala Val Met Met Leu Asp Asp Leu Leu Pro Leu 180 185 190 Lys Met Ile Gly Val Lys Lys Val Gln Gly Gly Ala Leu Glu Glu Ser 195 200 205 Gln Leu Val Ala Gly Val Ala Phe Lys Lys Thr Phe Ser Tyr Ala Gly 210 215 220 Phe Glu Met Gln Pro Lys Arg Tyr Met Asn Pro Lys Ile Ala Leu Leu 225 230 235 240 Asn Ile Glu Leu Glu Leu Lys Ala Glu Lys Asp Asn Ala Glu Val Arg 245 250 255 Val Asn Ser Met Glu Asp Tyr Gln Ala Ile Val Asp Ala Glu Trp Asn 260 265 270 Ile Leu Tyr Asp Lys Leu Glu Lys Ile His Lys Ser Gly Ala Lys Val 275 280 285 Val Leu Ser Lys Leu Pro Ile Gly Asp Val Ala Thr Gln Tyr Phe Ala 290 295 300 Tyr Arg Asn Leu Phe Cys Ala Ala Arg Val Val Glu Glu Asn Leu Lys 305 310 315 320 Arg Thr Met Met Ala Cys Gly Gly Ser Ile Gln Thr Ser Val Gly Ser 325 330 335 Leu Thr Asp Asp Val Leu Gly Gln Cys Glu Leu Phe Glu Glu Val Gln 340 345 350 Val Gly Gly Glu Arg Tyr Asn Phe Phe Lys Gly Cys Pro Lys Ala Lys 355 360 365 Thr Cys Thr Ile Ile Leu Arg Gly Gly Ala Glu Gln Phe Met Glu Glu 370 375 380 Thr Asp Arg Ser Leu His Asp Ala Ile Met Ile Val Arg Arg Ala Ile 385 390 395 400 Lys Asn Asp Ser Ile Val Ala Gly Gly Gly Ala Ile Glu Met Glu Leu 405 410 415 Ser Lys Tyr Leu Arg Asp Tyr Ser Arg Thr Ile Pro Gly Lys Gln Gln 420 425 430 Leu Leu Ile Gly Ala Tyr Ala Lys Ala Leu Glu Ile Ile Pro Arg Gln 435 440 445 Leu Cys Asp Asn Ala Gly Phe Asp Ala Thr Asn Ile Leu Asn Lys Leu 450 455 460 Arg Ala Lys His Ala Gln Gly Gly Met Trp Tyr Gly Val Asp Val Asn 465 470 475 480 Asn Glu Asp Ile Ala Asp Asn Phe Gln Ala Cys Val Trp Glu Pro Ser 485 490 495 Ile Val Arg Ile Asn Ala Leu Thr Ala Ala Ser Glu Ala Ala Cys Leu 500 505 510 Ile Leu Ser Val Asp Glu Thr Ile Lys Asn Pro Arg Ser Ser Val Asp 515 520 525 Gly Pro Pro Ala Ala Ala Gly Arg Gly Arg Gly Arg Gly Arg Pro Ala 530 535 540 His Ala His 545 127 1753 DNA Danio rerio 127 cttcctcttt ctgttacctg gcaaagggga gcagcagctg aggagtgatc tctcaatctt 60 gaaacttatc aatcatggga aaggaaaaga cccacattaa catcgtggtt attggccacg 120 tcgactccgg aaagtccacc accaccggcc atctgatcta caaatgcggt ggaatcgaca 180 agagaaccat cgagaagttc gagaaggaag ccgctgagat gggcaagggc tccttcaagt 240 acgcctgggt gttggacaaa ctgaaggccg agcgtgagcg tggtatcacc attgacattg 300 ctctctggaa attcgagacc agcaaatact acgtcaccat tattgatgcc cctggacaca 360 gagacttcat caagaacatg atcactggta cttctcaggc tgactgtgct gtgctgattg 420 ttgctggtgg tgtcggtgag tttgaggctg gtatctccaa gaacggacag acccgtgagc 480 acgccctcct ggctttcacc ctgggagtga aacagctgat cgttggagtc aacaagatgg 540 actccactga gcccccttac agccaggctc gttttgagga aatcaccaag gaagtcagcg 600 catacatcaa gaagatcggc tacaaccctg ccagtgttgc cttcgtccca atttcaggat 660 ggcacggtga caacatgctg gaggccagct caaacatggg ctggttcaag ggatggaaga 720 ttgagcgcaa ggagggtaat gctagcggta ctactcttct tgatgccctt gatgccattc 780 tgccccctag ccgtcccacc gacaagcccc tccgtctgcc acttcaggat gtgtacaaaa 840 ttggaggtat tggaactgta cctgtgggtc gtgtggagac tggtgtcctc aagcctggta 900 tggttgtgac cttcgcccct gccaatgtaa ccactgaggt caagtctgtt gagatgcacc 960 acgagtctct gactgaggcc actcctggtg acaacgttgg cttcaacgtt aagaacgtgt 1020 ctgtcaagga catccgtcgt ggtaatgtgg ctggagacag caagaacgac ccacccatgg 1080 aggctgccaa cttcaacgct caggtcatca tcctgaacca ccctggtcag atctctcagg 1140 gttacgcccc agtgctggat tgccacactg ctcacatcgc ctgcaagttt gctgagctca 1200 aggagaagat cgaccgtcgt tctggcaaga agcttgaaga caaccccaag gctctcaaat 1260 ccggagatgc cgccattgtt gagatggtcc ctggcaagcc catgtgtgtg gagagcttct 1320 ctacctaccc tcctcttggt cgctttgctg tgcgtgacat gaggcagacc gttgctgtcg 1380 gcgtcatcaa gagcgttgag aagaaaatcg gtggtgctgg caaggtcaca aagtctgcac 1440 agaaggctgc caagaccaag tgaatttccc tcaatcacac cgttccaaag gttgcggcgt 1500 gttcttccca acctcttgga atttctctaa acctgggcac tctacttaag gactggataa 1560 tgctgattaa aacccatcgg aaaaattttc gcaggaaagg aaaacaactt ggatttaagt 1620 gtggctccat ttattgactg atagtgcctc tttcagttat taaatttgtg ttttgatggt 1680 ttagaactgc acctgttgcc acagtacaat ttggaaacgc tgatgaataa actaataaag 1740 gtattaaaaa ttg 1753 128 462 PRT Danio rerio 128 Met Gly Lys Glu Lys Thr His Ile Asn Ile Val Val Ile Gly His Val 1 5 10 15 Asp Ser Gly Lys Ser Thr Thr Thr Gly His Leu Ile Tyr Lys Cys Gly 20 25 30 Gly Ile Asp Lys Arg Thr Ile Glu Lys Phe Glu Lys Glu Ala Ala Glu 35 40 45 Met Gly Lys Gly Ser Phe Lys Tyr Ala Trp Val Leu Asp Lys Leu Lys 50 55 60 Ala Glu Arg Glu Arg Gly Ile Thr Ile Asp Ile Ala Leu Trp Lys Phe 65 70 75 80 Glu Thr Ser Lys Tyr Tyr Val Thr Ile Ile Asp Ala Pro Gly His Arg 85 90 95 Asp Phe Ile Lys Asn Met Ile Thr Gly Thr Ser Gln Ala Asp Cys Ala 100 105 110 Val Leu Ile Val Ala Gly Gly Val Gly Glu Phe Glu Ala Gly Ile Ser 115 120 125 Lys Asn Gly Gln Thr Arg Glu His Ala Leu Leu Ala Phe Thr Leu Gly 130 135 140 Val Lys Gln Leu Ile Val Gly Val Asn Lys Met Asp Ser Thr Glu Pro 145 150 155 160 Pro Tyr Ser Gln Ala Arg Phe Glu Glu Ile Thr Lys Glu Val Ser Ala 165 170 175 Tyr Ile Lys Lys Ile Gly Tyr Asn Pro Ala Ser Val Ala Phe Val Pro 180 185 190 Ile Ser Gly Trp His Gly Asp Asn Met Leu Glu Ala Ser Ser Asn Met 195 200 205 Gly Trp Phe Lys Gly Trp Lys Ile Glu Arg Lys Glu Gly Asn Ala Ser 210 215 220 Gly Thr Thr Leu Leu Asp Ala Leu Asp Ala Ile Leu Pro Pro Ser Arg 225 230 235 240 Pro Thr Asp Lys Pro Leu Arg Leu Pro Leu Gln Asp Val Tyr Lys Ile 245 250 255 Gly Gly Ile Gly Thr Val Pro Val Gly Arg Val Glu Thr Gly Val Leu 260 265 270 Lys Pro Gly Met Val Val Thr Phe Ala Pro Ala Asn Val Thr Thr Glu 275 280 285 Val Lys Ser Val Glu Met His His Glu Ser Leu Thr Glu Ala Thr Pro 290 295 300 Gly Asp Asn Val Gly Phe Asn Val Lys Asn Val Ser Val Lys Asp Ile 305 310 315 320 Arg Arg Gly Asn Val Ala Gly Asp Ser Lys Asn Asp Pro Pro Met Glu 325 330 335 Ala Ala Asn Phe Asn Ala Gln Val Ile Ile Leu Asn His Pro Gly Gln 340 345 350 Ile Ser Gln Gly Tyr Ala Pro Val Leu Asp Cys His Thr Ala His Ile 355 360 365 Ala Cys Lys Phe Ala Glu Leu Lys Glu Lys Ile Asp Arg Arg Ser Gly 370 375 380 Lys Lys Leu Glu Asp Asn Pro Lys Ala Leu Lys Ser Gly Asp Ala Ala 385 390 395 400 Ile Val Glu Met Val Pro Gly Lys Pro Met Cys Val Glu Ser Phe Ser 405 410 415 Thr Tyr Pro Pro Leu Gly Arg Phe Ala Val Arg Asp Met Arg Gln Thr 420 425 430 Val Ala Val Gly Val Ile Lys Ser Val Glu Lys Lys Ile Gly Gly Ala 435 440 445 Gly Lys Val Thr Lys Ser Ala Gln Lys Ala Ala Lys Thr Lys 450 455 460 129 1891 DNA Danio rerio 129 acgcggggat cccataatgc acctcgtcgc tgtcatggct gctcactgat gacacttgcg 60 gcacatgggc agaaatctaa cacattacag tgaacacacg aacctcaaca cgagtctcac 120 actcacacag agagcaggtc ctgagcgacg cgacgctgtt tcattctccg cggaaccggg 180 cacacaagga gcgcgatgct gcagctcgag gcggcggttc tgggcggatt gagtctggac 240 tggaaacaga gcagcttcag caccgcagat gcaggttcat tgctgcagtt ggtatatgag 300 ggagattttg aggccgtcct cttcagctct gcggtgcagg ggcttctggg agtttgtcct 360 gaagagggtg acagcattga ggcctatctg gagagacagg tgctgtctta cctgagtgat 420 gccaccgagg agcagaggag ccacagggaa gttgctctgt tggcggtggc agtggggtgt 480 ctgaaccttt tcgcccagag taactggacc ggcccaccag ttgagctcca tgtctctgat 540 gttcttcctg aagccctgct ccagaacttt tctcagcctg cggccctgag tacagctctg 600 ttgtccagtc ttcagctgga tggagagtct gtgtacagtc tggtctccaa ccctctcctg 660 ctgctgctga ccagagtcat cttcgtcaac tgcgccgcca aactggagag ccttcagctg 720 ctgccctggt ggacgctgcg atatgtcgct ctccatcagc agatactgga ggaaagatcc 780 ccacagctcc tcaacctgat gctcagctgc atcgagaaag tgtgtaaatg tgacgagctg 840 ttcagcagta atgcacacag aaacctggcc attcagttcc acctggagtg cagttatatc 900 tgcctgacct actatgagta tcacaaggcc aaagaacatc tgcagacggc tcgtgatctg 960 tcaggactcg atgttaacat gacaggtgct ctagggaagc gaactcgttt tcaggagaac 1020 tttttggctc agctcatctt agatgtaaag cgaaaggaca actcgccctt ttacaactcc 1080 gagagtccct ccagtacacc aacacctaaa gagctcctac caaaggacca tcagctgagc 1140 gacgatacag tcctaaacca gattaatctt gcagaaccat ctgagcatga acttccagac 1200 ctaagcgctg aggagcaagc cttcatgctg gccacatgca cagactttcg gaagaataat 1260 ccagttcaca agctgaatga tgaggagctc ctggccttca ccacggtaag acacattgtt 1320 attctctaac aggccgcagt gcagttgaca catctcttat cagcacacac agttgtgttt 1380 tcaggttgac agatggaggg catgcaaaca ggtctctctc aattgctcac agtttagcac 1440 ctagaatgtg tgtgtttgct ctaacaaatg caaacaaagg aaaatggtct gtggcttctg 1500 ctgttgtgct ccacaaaaat gaaaagatgt caatctctgg tcatttctgc tgctctgctc 1560 aaatctacaa tgtatagttc ttctcaaaag ttgacatgcc cttcgcaaaa tctccaagat 1620 gattatcatt tttaaataag caaacatgag cattaggaaa agttgatcga ctatctctca 1680 tgccaagtgc tgtcggaatt ttaaatggaa accattggtt tattttttta tacatttata 1740 ttatatgatg aatttgagac aattttccac agtatgaata ataaagtaat tattaaatgc 1800 aaattttact ttttttttct tcttcaattt tgtgctttat aataattata gcaccgacac 1860 aacagcaaaa aaaaaaaaaa aaaaaaaaaa a 1891 130 377 PRT Danio rerio 130 Met Leu Gln Leu Glu Ala Ala Val Leu Gly Gly Leu Ser Leu Asp Trp 1 5 10 15 Lys Gln Ser Ser Phe Ser Thr Ala Asp Ala Gly Ser Leu Leu Gln Leu 20 25 30 Val Tyr Glu Gly Asp Phe Glu Ala Val Leu Phe Ser Ser Ala Val Gln 35 40 45 Gly Leu Leu Gly Val Cys Pro Glu Glu Gly Asp Ser Ile Glu Ala Tyr 50 55 60 Leu Glu Arg Gln Val Leu Ser Tyr Leu Ser Asp Ala Thr Glu Glu Gln 65 70 75 80 Arg Ser His Arg Glu Val Ala Leu Leu Ala Val Ala Val Gly Cys Leu 85 90 95 Asn Leu Phe Ala Gln Ser Asn Trp Thr Gly Pro Pro Val Glu Leu His 100 105 110 Val Ser Asp Val Leu Pro Glu Ala Leu Leu Gln Asn Phe Ser Gln Pro 115 120 125 Ala Ala Leu Ser Thr Ala Leu Leu Ser Ser Leu Gln Leu Asp Gly Glu 130 135 140 Ser Val Tyr Ser Leu Val Ser Asn Pro Leu Leu Leu Leu Leu Thr Arg 145 150 155 160 Val Ile Phe Val Asn Cys Ala Ala Lys Leu Glu Ser Leu Gln Leu Leu 165 170 175 Pro Trp Trp Thr Leu Arg Tyr Val Ala Leu His Gln Gln Ile Leu Glu 180 185 190 Glu Arg Ser Pro Gln Leu Leu Asn Leu Met Leu Ser Cys Ile Glu Lys 195 200 205 Val Cys Lys Cys Asp Glu Leu Phe Ser Ser Asn Ala His Arg Asn Leu 210 215 220 Ala Ile Gln Phe His Leu Glu Cys Ser Tyr Ile Cys Leu Thr Tyr Tyr 225 230 235 240 Glu Tyr His Lys Ala Lys Glu His Leu Gln Thr Ala Arg Asp Leu Ser 245 250 255 Gly Leu Asp Val Asn Met Thr Gly Ala Leu Gly Lys Arg Thr Arg Phe 260 265 270 Gln Glu Asn Phe Leu Ala Gln Leu Ile Leu Asp Val Lys Arg Lys Asp 275 280 285 Asn Ser Pro Phe Tyr Asn Ser Glu Ser Pro Ser Ser Thr Pro Thr Pro 290 295 300 Lys Glu Leu Leu Pro Lys Asp His Gln Leu Ser Asp Asp Thr Val Leu 305 310 315 320 Asn Gln Ile Asn Leu Ala Glu Pro Ser Glu His Glu Leu Pro Asp Leu 325 330 335 Ser Ala Glu Glu Gln Ala Phe Met Leu Ala Thr Cys Thr Asp Phe Arg 340 345 350 Lys Asn Asn Pro Val His Lys Leu Asn Asp Glu Glu Leu Leu Ala Phe 355 360 365 Thr Thr Val Arg His Ile Val Ile Leu 370 375 131 565 DNA Danio rerio 131 acgcgggagc agcggtaaca acgcagagta cgcggggctc tttcccgtct gaaccgccgc 60 catgaaggtc gagctgtgca gttttagcgg atacaaaatc tatcccggtc acggccggcg 120 atacgccagg gttgacggaa aggttttcca gttcctcaat gccaaatgtg agtctgcgtt 180 tctgtccaag aggaacccga ggcagatcaa ctggaccgtt ctgtaccgcc gcaagcacaa 240 gaagggccag tctgaggagg tgtcgaagaa gcgcacacgt cgtgcagtaa agttccagcg 300 ggcgatcacc ggcgcttctc tggccgagat tctggccaag aggaaccaga agcctgaagt 360 gcgtaaagcc cagcgcgagc aggccatcag ggctgcaaag gaggcgaaga aagccaagca 420 ggccaccaag aagcagacga cccagagcag caaggctcct gctaaatcag ctcagaaaca 480 gaagattgcc aagcccatga aggtcagcgc tcctcgcgtc ggtggcaaac gctaaactct 540 gatgtgctaa taaagattca ctgct 565 132 157 PRT Danio rerio 132 Met Lys Val Glu Leu Cys Ser Phe Ser Gly Tyr Lys Ile Tyr Pro Gly 1 5 10 15 His Gly Arg Arg Tyr Ala Arg Val Asp Gly Lys Val Phe Gln Phe Leu 20 25 30 Asn Ala Lys Cys Glu Ser Ala Phe Leu Ser Lys Arg Asn Pro Arg Gln 35 40 45 Ile Asn Trp Thr Val Leu Tyr Arg Arg Lys His Lys Lys Gly Gln Ser 50 55 60 Glu Glu Val Ser Lys Lys Arg Thr Arg Arg Ala Val Lys Phe Gln Arg 65 70 75 80 Ala Ile Thr Gly Ala Ser Leu Ala Glu Ile Leu Ala Lys Arg Asn Gln 85 90 95 Lys Pro Glu Val Arg Lys Ala Gln Arg Glu Gln Ala Ile Arg Ala Ala 100 105 110 Lys Glu Ala Lys Lys Ala Lys Gln Ala Thr Lys Lys Gln Thr Thr Gln 115 120 125 Ser Ser Lys Ala Pro Ala Lys Ser Ala Gln Lys Gln Lys Ile Ala Lys 130 135 140 Pro Met Lys Val Ser Ala Pro Arg Val Gly Gly Lys Arg 145 150 155 133 1735 DNA Danio rerio 133 gcagcggaca gcacaatgtc taatggtcag gaaacggcgt gtctaaacgg cgaaccggcg 60 ctcaaacgct ccagagacag tctggattca ctgcgggtcc cgcgggagcc ccagaataaa 120 gtgaccgtag tgctgggcgc gcagtggggc gatgagggga aggggaaagt ggtcgacctc 180 ttggcaatgg acgccgacat tgtatgcaga tgccagggtg gaaatagcgc gggacacact 240 gtggtcgtgg actcggtcga gtatgacttc cacctgctgc ccagcggagt tcttaacaaa 300 aaggccactt cctttattgg aaatggtgtt gtgatacacc tgccaggact ctttgacgag 360 gctgagaaga acttgcagaa gggcaatgga cttcaaggat gggaggaacg actgaagata 420 tctgaccggg cacacattgt gttcaacttc caccaggctg ttgatggcat acaggagcag 480 ctcagacaac agcaggcagg aaaaaatttg ggcacaacta agaagggtat tggacctgca 540 tactcttcca aagcagcacg taatggactg agagtgtgtg atctggtctc cgacttctct 600 gtctttgaag aaaagtttcg ggtgctggct ggacattatc agaccactta tcctaatctt 660 aacattgaca ttgatgctga gcttcagcag ctcaagaatt ttgctgaaag attacggcca 720 ctagtgactg atggggttta tttcatgcac aaagccctca ctggaccaag caagaagatt 780 ctggtggaag gagccaatgc tgctttactg gatattgatt ttgggaccta cccctttgtg 840 acctcatcaa actgcactgt tggaggcgtc tgcacaggtc ttggcatccc cccatctcat 900 gttggccgtg tgtatggagt ggtgaaggcc tacaccacca gggttggagt cggggcattt 960 cccactgagc agaacaatga aattggagat ctgctgcaga gcagagggag ggaatttgga 1020 gttacaacag gcaggcggcg gcgctgtggg tggctggatc tggttttggt tcgctatgcc 1080 cacatggtta acgggttttc agccattgcg ctgactaagt tggacattct tgacacattg 1140 cctgaaataa agattggcac agcctacaca gttgatggaa agcctcttcc cagttttcct 1200 gcaaacatgg acatgctcac aaaggttcag gtgacttatg agacgttccc tggctggagt 1260 tgtagcactg agggagtgcg cagcttcgat gagctgccta cacaggcaca aagctacatt 1320 aaatttattg aggattttct gcaagtgcca gtgaagtggg ttggagttgg caagtcaaga 1380 gaaagcatga tcaagctgtt ttgacgaggg cgaagggaca ggctgagctg atggtcatcg 1440 actggtctgg cacgacattt atcatcatga ccgctggagc acctgaactg gcttttatgc 1500 atcagcgctg agatgaagac taaccgtggg tttagaatcc actgaatgtc ccactcaaag 1560 tgaacaccag cattgtgtat aggggtcaga tttgagcacc agacctgctt cagtgcattt 1620 gtaattgctt gattcgttcc tgtttgtttc caaagcgcat cccaaacctg ctcaaaacat 1680 acctctgcta atcagtaata gtctgccggt aaaagtaatg agctcccaac tgaaa 1735 134 462 PRT Danio rerio 134 Met Ser Asn Gly Gln Glu Thr Ala Cys Leu Asn Gly Glu Pro Ala Leu 1 5 10 15 Lys Arg Ser Arg Asp Ser Leu Asp Ser Leu Arg Val Pro Arg Glu Pro 20 25 30 Gln Asn Lys Val Thr Val Val Leu Gly Ala Gln Trp Gly Asp Glu Gly 35 40 45 Lys Gly Lys Val Val Asp Leu Leu Ala Met Asp Ala Asp Ile Val Cys 50 55 60 Arg Cys Gln Gly Gly Asn Ser Ala Gly His Thr Val Val Val Asp Ser 65 70 75 80 Val Glu Tyr Asp Phe His Leu Leu Pro Ser Gly Val Leu Asn Lys Lys 85 90 95 Ala Thr Ser Phe Ile Gly Asn Gly Val Val Ile His Leu Pro Gly Leu 100 105 110 Phe Asp Glu Ala Glu Lys Asn Leu Gln Lys Gly Asn Gly Leu Gln Gly 115 120 125 Trp Glu Glu Arg Leu Lys Ile Ser Asp Arg Ala His Ile Val Phe Asn 130 135 140 Phe His Gln Ala Val Asp Gly Ile Gln Glu Gln Leu Arg Gln Gln Gln 145 150 155 160 Ala Gly Lys Asn Leu Gly Thr Thr Lys Lys Gly Ile Gly Pro Ala Tyr 165 170 175 Ser Ser Lys Ala Ala Arg Asn Gly Leu Arg Val Cys Asp Leu Val Ser 180 185 190 Asp Phe Ser Val Phe Glu Glu Lys Phe Arg Val Leu Ala Gly His Tyr 195 200 205 Gln Thr Thr Tyr Pro Asn Leu Asn Ile Asp Ile Asp Ala Glu Leu Gln 210 215 220 Gln Leu Lys Asn Phe Ala Glu Arg Leu Arg Pro Leu Val Thr Asp Gly 225 230 235 240 Val Tyr Phe Met His Lys Ala Leu Thr Gly Pro Ser Lys Lys Ile Leu 245 250 255 Val Glu Gly Ala Asn Ala Ala Leu Leu Asp Ile Asp Phe Gly Thr Tyr 260 265 270 Pro Phe Val Thr Ser Ser Asn Cys Thr Val Gly Gly Val Cys Thr Gly 275 280 285 Leu Gly Ile Pro Pro Ser His Val Gly Arg Val Tyr Gly Val Val Lys 290 295 300 Ala Tyr Thr Thr Arg Val Gly Val Gly Ala Phe Pro Thr Glu Gln Asn 305 310 315 320 Asn Glu Ile Gly Asp Leu Leu Gln Ser Arg Gly Arg Glu Phe Gly Val 325 330 335 Thr Thr Gly Arg Arg Arg Arg Cys Gly Trp Leu Asp Leu Val Leu Val 340 345 350 Arg Tyr Ala His Met Val Asn Gly Phe Ser Ala Ile Ala Leu Thr Lys 355 360 365 Leu Asp Ile Leu Asp Thr Leu Pro Glu Ile Lys Ile Gly Thr Ala Tyr 370 375 380 Thr Val Asp Gly Lys Pro Leu Pro Ser Phe Pro Ala Asn Met Asp Met 385 390 395 400 Leu Thr Lys Val Gln Val Thr Tyr Glu Thr Phe Pro Gly Trp Ser Cys 405 410 415 Ser Thr Glu Gly Val Arg Ser Phe Asp Glu Leu Pro Thr Gln Ala Gln 420 425 430 Ser Tyr Ile Lys Phe Ile Glu Asp Phe Leu Gln Val Pro Val Lys Trp 435 440 445 Val Gly Val Gly Lys Ser Arg Glu Ser Met Ile Lys Leu Phe 450 455 460 135 730 DNA Danio rerio 135 gtcgtgctga tttaaacgcg tccagcttca cttttcctca attaatttgc ttttttcaca 60 atgtccataa aaggtaaatg ctcttttcag cgacttctat gttgatgtga gccagtacag 120 ggatcagcat tttaagggca accgttacga gcaggagaag ctcctcaaac agagcgccac 180 gctgtatgta ggaaacctgt ccttctacac cacagaggag caggtgcatg tgctcttcgc 240 caagtgtgga gacgtcaaga ggatcatcat cggcctcgac aaaatcaaga agactgcgtg 300 cggcttctgc ttcgtcgaat attacactcg tgcggacgct gagaacgcca tgaggtttgt 360 caacggcacg cgtcttgacg accgcatcat cagaacagac tgggacgcag gcttcaaaga 420 gggccgacag tacggacggg ggaaatctgg tggtcaggtc agagatgaat acaggcagga 480 ctatgatccc gccagagggg gttatgggaa gatggtgcag aaatcttgaa catccctttc 540 gtcattttcc ttttcttttt ttttaatctt ttaaaatatt tccacgttat gaatgaatct 600 ggcaggatta cggctgtttt ctgaatgtac atatcggagt ttgctttagg aatcgctcaa 660 gttcttctgg tccagcatga tgagtttaat aaattgtgtt ttgtttccca aaaaaaaaaa 720 aaaaaaaaaa 730 136 145 PRT Danio rerio 136 Asp Phe Tyr Val Asp Val Ser Gln Tyr Arg Asp Gln His Phe Lys Gly 1 5 10 15 Asn Arg Tyr Glu Gln Glu Lys Leu Leu Lys Gln Ser Ala Thr Leu Tyr 20 25 30 Val Gly Asn Leu Ser Phe Tyr Thr Thr Glu Glu Gln Val His Val Leu 35 40 45 Phe Ala Lys Cys Gly Asp Val Lys Arg Ile Ile Ile Gly Leu Asp Lys 50 55 60 Ile Lys Lys Thr Ala Cys Gly Phe Cys Phe Val Glu Tyr Tyr Thr Arg 65 70 75 80 Ala Asp Ala Glu Asn Ala Met Arg Phe Val Asn Gly Thr Arg Leu Asp 85 90 95 Asp Arg Ile Ile Arg Thr Asp Trp Asp Ala Gly Phe Lys Glu Gly Arg 100 105 110 Gln Tyr Gly Arg Gly Lys Ser Gly Gly Gln Val Arg Asp Glu Tyr Arg 115 120 125 Gln Asp Tyr Asp Pro Ala Arg Gly Gly Tyr Gly Lys Met Val Gln Lys 130 135 140 Ser 145 137 2106 DNA Danio rerio 137 gcaggtcacg ctagcccgtc gagccaggag aaatacagca tttttcattt ccagcactct 60 ggtctaatgg gaacactctt tgtaaatgcg ttttaagatt cttgcttgag agaagtacac 120 accggccctg gccttgaggc ttatctgaaa ggtttcaact tcactggtga acctccttgg 180 ctgtctgaca tctgttagcc acgtttcaat cccatctctt ccattcgtct gtttcttttg 240 caatcaaatc ttgaacaaac actatggcgg cttgcaccgg atcagacttt gacttcgcct 300 tcctggagga gggattctgt gcgcgcgata tcgtggagca gaagatcaat gagtcctcgc 360 tctcagatga taaagatgct ttctatgtgg cggatctggg tgatgtcctg aagaagcacc 420 tccgctggct gcgtgttttg ccccgcatca cgccgtttta cgcagtgaag tgcaacgata 480 gcagggcggt ggtcaccaca ctggcgtctc tgggagccgg gttcgactgt gccagcaaaa 540 cggagatcca gatcgtgcag tcagtgggtg tggatcccag caggatcatc tacgccaacc 600 cgtgcaagca ggtttcgcag atcaaatacg cctctgctca cggcgtgcag atgatgacgt 660 tcgacagcga ggtggagctc atgaaggtgg cccgtagcca cgaaaatgcc aagctggttc 720 tccgcatcgc cacagacgac tcgaaggctg tttgcaggct gagtgtgaag tttggagcga 780 cgctgaagag cagccggctg ctgctggaga gagcaaaaga gctcgggctg gacgtgatcg 840 gggtcagttt ccatgtgggc agcggctgta cggacccgga aacatacagc caggccatct 900 cagatgctcg ctatgttttc gacattgggg cggagctggg ctataacatg agcctgctgg 960 acatcggcgg aggatttcca ggctctgagg acaccaaact caagtttgaa gagattgctg 1020 ctgttattaa ccctgcgctg gataaatact tccctgtgga ctctggcgtg aggatcatcg 1080 ctgaacccgg ccgttattat gtagcatctg cttacacact ggcggtcaac atcatcgcca 1140 aaaaggtcat catgaaggag cagtcagcct ccgatgaaga agaggatggg tccaatgacc 1200 gaaccctgat gtactacgtg aatgacggcg tttatggctc cttcaactgc atcctgtatg 1260 atcatgcgca cgtgttgcca actctgcaca agaaacccaa gcccgatgag cgcatgtacc 1320 cctgcagtat ctggggcccg acctgcgacg gtctggaccg cattgtggag cagtgcagcc 1380 tgccggacat gcaggtgggc gactggctgc tgttcgagaa catgggcgcc tacaccgtcg 1440 ccgcatcctc caccttcaat ggcttcaaga aaccgatatt ttattacatc atgtcccgaa 1500 cagcctggca gtgcatgcag cagattcgtg cccagcggat tcctgcactt ccattggagg 1560 agccgagcgc tggaaacgtg ccatcccact gcgggcgcga gagcagtctg gatgttcccg 1620 ccaaaccctg cccgactcaa gtgctgtagc agaagacgct caaccaaacc tgctctcggg 1680 tccagctaat gcagatctgt gttactgatc tccaacttct gatctaaacg cctcagagct 1740 gaggagaaaa caaacgcaca gtccaattct tatttcctgt tatctttgtt ttaactggtt 1800 tttgttgatt gtctgtattt gtatttcgag gccagatgct tgagcgagga cttcttcctc 1860 ttcctcagtg accatttcca ttttatactt gattagtaac ttaatttttt aatattgagt 1920 tgaagtcttt gtattataat tattcagagc tgagcttgta tgccactggt gcatttgaaa 1980 cggacgtgaa agagatggga ttgcatctat tatgctttcc tatggaaact gctttttgtt 2040 ttttttaagt attacgtttt tttgtatttt tatataaata aagatgctgt aaaatggcaa 2100 aaaaaa 2106 138 461 PRT Danio rerio 138 Met Ala Ala Cys Thr Gly Ser Asp Phe Asp Phe Ala Phe Leu Glu Glu 1 5 10 15 Gly Phe Cys Ala Arg Asp Ile Val Glu Gln Lys Ile Asn Glu Ser Ser 20 25 30 Leu Ser Asp Asp Lys Asp Ala Phe Tyr Val Ala Asp Leu Gly Asp Val 35 40 45 Leu Lys Lys His Leu Arg Trp Leu Arg Val Leu Pro Arg Ile Thr Pro 50 55 60 Phe Tyr Ala Val Lys Cys Asn Asp Ser Arg Ala Val Val Thr Thr Leu 65 70 75 80 Ala Ser Leu Gly Ala Gly Phe Asp Cys Ala Ser Lys Thr Glu Ile Gln 85 90 95 Ile Val Gln Ser Val Gly Val Asp Pro Ser Arg Ile Ile Tyr Ala Asn 100 105 110 Pro Cys Lys Gln Val Ser Gln Ile Lys Tyr Ala Ser Ala His Gly Val 115 120 125 Gln Met Met Thr Phe Asp Ser Glu Val Glu Leu Met Lys Val Ala Arg 130 135 140 Ser His Glu Asn Ala Lys Leu Val Leu Arg Ile Ala Thr Asp Asp Ser 145 150 155 160 Lys Ala Val Cys Arg Leu Ser Val Lys Phe Gly Ala Thr Leu Lys Ser 165 170 175 Ser Arg Leu Leu Leu Glu Arg Ala Lys Glu Leu Gly Leu Asp Val Ile 180 185 190 Gly Val Ser Phe His Val Gly Ser Gly Cys Thr Asp Pro Glu Thr Tyr 195 200 205 Ser Gln Ala Ile Ser Asp Ala Arg Tyr Val Phe Asp Ile Gly Ala Glu 210 215 220 Leu Gly Tyr Asn Met Ser Leu Leu Asp Ile Gly Gly Gly Phe Pro Gly 225 230 235 240 Ser Glu Asp Thr Lys Leu Lys Phe Glu Glu Ile Ala Ala Val Ile Asn 245 250 255 Pro Ala Leu Asp Lys Tyr Phe Pro Val Asp Ser Gly Val Arg Ile Ile 260 265 270 Ala Glu Pro Gly Arg Tyr Tyr Val Ala Ser Ala Tyr Thr Leu Ala Val 275 280 285 Asn Ile Ile Ala Lys Lys Val Ile Met Lys Glu Gln Ser Ala Ser Asp 290 295 300 Glu Glu Glu Asp Gly Ser Asn Asp Arg Thr Leu Met Tyr Tyr Val Asn 305 310 315 320 Asp Gly Val Tyr Gly Ser Phe Asn Cys Ile Leu Tyr Asp His Ala His 325 330 335 Val Leu Pro Thr Leu His Lys Lys Pro Lys Pro Asp Glu Arg Met Tyr 340 345 350 Pro Cys Ser Ile Trp Gly Pro Thr Cys Asp Gly Leu Asp Arg Ile Val 355 360 365 Glu Gln Cys Ser Leu Pro Asp Met Gln Val Gly Asp Trp Leu Leu Phe 370 375 380 Glu Asn Met Gly Ala Tyr Thr Val Ala Ala Ser Ser Thr Phe Asn Gly 385 390 395 400 Phe Lys Lys Pro Ile Phe Tyr Tyr Ile Met Ser Arg Thr Ala Trp Gln 405 410 415 Cys Met Gln Gln Ile Arg Ala Gln Arg Ile Pro Ala Leu Pro Leu Glu 420 425 430 Glu Pro Ser Ala Gly Asn Val Pro Ser His Cys Gly Arg Glu Ser Ser 435 440 445 Leu Asp Val Pro Ala Lys Pro Cys Pro Thr Gln Val Leu 450 455 460 139 2043 DNA Danio rerio 139 ctgagtcggt ctcagtgaag atctggcgtc gatgattttt caactgtgta cgaagagagg 60 gccccaaaat ctcttacttt tttctactca aataacggac attcgcttac taccttttgt 120 gctccatatg gttcatggaa ggaaaatctc aatttcagcc accacaatcc cgcccattat 180 gccatttttc agccaataat gcttaagatc atgacagttt acgttctctt aaactgttat 240 aaagctgaaa tttgttgctg ttaagcacaa aatattaata gcaacccgag acgcagagga 300 gaggtattaa tataagcagc catggccgtg ggcccagtgg accccagaga ggtgttgaaa 360 ggagttgaag cactgctggg gaaggatgga gatctacgta gttttgaggg agtcgctaaa 420 gtcttcagtc ttatgaaggc ctcacataat atggtcatca gatgcatgta tctgaacata 480 tcacttgaga caaaatcaca tgatgtgcta attcgattca ttcgagttgg aggctacaag 540 ttgttgaact cttggctcac ctactcgaag accaccacca acacgcctct gttgcagctc 600 attctgctga ccctgcagaa gcttcctctt actgtggacc acctcaaaca gaataacact 660 gccaagctgg tgaaacatct gagcaagagc ggagagacag aagagctgag gaagttggca 720 tctgtgctgg tggatggctg gatggctatt attcgctctc agagcgtctc cagtggagca 780 tcgcctaatg acaagaaacg gaaaaaggag gatggaaaag tgcgtccaga ggtgaagcca 840 aaggaaaagg cagctgagga ggagaagaag aaggacaaac caaaagctta tgctccaagt 900 catgctaaga tccgctctac aggtttggag gtggagagtc cggctccagt tcctgtgaag 960 aagccacccg ttgcacccca acttggagac aaatacaaca tcaagccagc tgtacttaaa 1020 agaccaagca catctttgac tgacgcacca ccagtggaaa agaagtacaa gccacttaac 1080 accaccccca actcaaccaa agagatcaaa gtcaaactca tacccccaca gccgatggag 1140 gccactggat ttttggacgc cctcaactct gctccagtgc ctggcatcaa gatcaagaaa 1200 aagaaaccaa aagcagtttc acccaccagt aataagccct gtccatttga cagcaggccg 1260 cagtcatttt caggcgcaca ggccaaacca tcttcccctg aaacgccttc atctcaaacc 1320 cctcctcatg agaatcaaga actggagcag cctggcacac ccgtgcccac cgaggagcct 1380 gaagccatgg aaaccggtga taaacccaat gttctggatg aacctcgcag tgaggaggag 1440 attctgatta agaaaggaaa gaagaagaac agcgtgcggt gggcgcaaga ggatcatctc 1500 aaagagtact tctatttcga tctggatgaa actgagagag tgaatgtgaa caaaattaag 1560 gactttggtg aggcggctaa gcgggagctg atgatggata ggcagacgtt tgaaatggct 1620 cgtcgtcttt ctcatgacac catggaggag agggtgccct ggaccccacc gcgacccttg 1680 accctattag gctgcctggt cactgcaggt gccagcagcc gagagaagat gatccagaga 1740 gaccgagaga tgggcattct gcaggagatc ttcctctgca aggagagtgt tccagacagt 1800 cctcatgagc cagatcctga gccgtatgag cccatgcctc ctcgtctcat cccactggac 1860 gaggatagca gtatagtaga ggacatggat tccactgatc ctgcctctac tcccggctct 1920 aatgaaggct ccaagctgcc tcctgtattt gttaacctta tggggagcct gggcaacagc 1980 ggccgcatcc cgcaggcaca gggcaatgcc ccacagccaa aaaaaaaaaa aaaaaaaaaa 2040 aaa 2043 140 574 PRT Danio rerio 140 Met Ala Val Gly Pro Val Asp Pro Arg Glu Val Leu Lys Gly Val Glu 1 5 10 15 Ala Leu Leu Gly Lys Asp Gly Asp Leu Arg Ser Phe Glu Gly Val Ala 20 25 30 Lys Val Phe Ser Leu Met Lys Ala Ser His Asn Met Val Ile Arg Cys 35 40 45 Met Tyr Leu Asn Ile Ser Leu Glu Thr Lys Ser His Asp Val Leu Ile 50 55 60 Arg Phe Ile Arg Val Gly Gly Tyr Lys Leu Leu Asn Ser Trp Leu Thr 65 70 75 80 Tyr Ser Lys Thr Thr Thr Asn Thr Pro Leu Leu Gln Leu Ile Leu Leu 85 90 95 Thr Leu Gln Lys Leu Pro Leu Thr Val Asp His Leu Lys Gln Asn Asn 100 105 110 Thr Ala Lys Leu Val Lys His Leu Ser Lys Ser Gly Glu Thr Glu Glu 115 120 125 Leu Arg Lys Leu Ala Ser Val Leu Val Asp Gly Trp Met Ala Ile Ile 130 135 140 Arg Ser Gln Ser Val Ser Ser Gly Ala Ser Pro Asn Asp Lys Lys Arg 145 150 155 160 Lys Lys Glu Asp Gly Lys Val Arg Pro Glu Val Lys Pro Lys Glu Lys 165 170 175 Ala Ala Glu Glu Glu Lys Lys Lys Asp Lys Pro Lys Ala Tyr Ala Pro 180 185 190 Ser His Ala Lys Ile Arg Ser Thr Gly Leu Glu Val Glu Ser Pro Ala 195 200 205 Pro Val Pro Val Lys Lys Pro Pro Val Ala Pro Gln Leu Gly Asp Lys 210 215 220 Tyr Asn Ile Lys Pro Ala Val Leu Lys Arg Pro Ser Thr Ser Leu Thr 225 230 235 240 Asp Ala Pro Pro Val Glu Lys Lys Tyr Lys Pro Leu Asn Thr Thr Pro 245 250 255 Asn Ser Thr Lys Glu Ile Lys Val Lys Leu Ile Pro Pro Gln Pro Met 260 265 270 Glu Ala Thr Gly Phe Leu Asp Ala Leu Asn Ser Ala Pro Val Pro Gly 275 280 285 Ile Lys Ile Lys Lys Lys Lys Pro Lys Ala Val Ser Pro Thr Ser Asn 290 295 300 Lys Pro Cys Pro Phe Asp Ser Arg Pro Gln Ser Phe Ser Gly Ala Gln 305 310 315 320 Ala Lys Pro Ser Ser Pro Glu Thr Pro Ser Ser Gln Thr Pro Pro His 325 330 335 Glu Asn Gln Glu Leu Glu Gln Pro Gly Thr Pro Val Pro Thr Glu Glu 340 345 350 Pro Glu Ala Met Glu Thr Gly Asp Lys Pro Asn Val Leu Asp Glu Pro 355 360 365 Arg Ser Glu Glu Glu Ile Leu Ile Lys Lys Gly Lys Lys Lys Asn Ser 370 375 380 Val Arg Trp Ala Gln Glu Asp His Leu Lys Glu Tyr Phe Tyr Phe Asp 385 390 395 400 Leu Asp Glu Thr Glu Arg Val Asn Val Asn Lys Ile Lys Asp Phe Gly 405 410 415 Glu Ala Ala Lys Arg Glu Leu Met Met Asp Arg Gln Thr Phe Glu Met 420 425 430 Ala Arg Arg Leu Ser His Asp Thr Met Glu Glu Arg Val Pro Trp Thr 435 440 445 Pro Pro Arg Pro Leu Thr Leu Leu Gly Cys Leu Val Thr Ala Gly Ala 450 455 460 Ser Ser Arg Glu Lys Met Ile Gln Arg Asp Arg Glu Met Gly Ile Leu 465 470 475 480 Gln Glu Ile Phe Leu Cys Lys Glu Ser Val Pro Asp Ser Pro His Glu 485 490 495 Pro Asp Pro Glu Pro Tyr Glu Pro Met Pro Pro Arg Leu Ile Pro Leu 500 505 510 Asp Glu Asp Ser Ser Ile Val Glu Asp Met Asp Ser Thr Asp Pro Ala 515 520 525 Ser Thr Pro Gly Ser Asn Glu Gly Ser Lys Leu Pro Pro Val Phe Val 530 535 540 Asn Leu Met Gly Ser Leu Gly Asn Ser Gly Arg Ile Pro Gln Ala Gln 545 550 555 560 Gly Asn Ala Pro Gln Pro Lys Lys Lys Lys Lys Lys Lys Lys 565 570 141 1254 DNA Danio rerio 141 ggcagggtag catgtggagg tcagagatgt agaaacgctt cgtcagagaa taaaaacttt 60 aattattaga catcagcgtt tgataactcg acatagtgtc atctatgcgg atttacatca 120 gatattattg aacacaaggc tgtcggtaaa gagattttgt gtggatagtc tccatcagtt 180 tgacgcgggt cagtaaggaa gaattgtaat cagcgaagcg gtattaactg tataagtgaa 240 tctccgaaaa catggataac aacacaccgc caccgggagg atttaaagga ggactcggga 300 gcatttttgg tggaggaacc cctgagtatt caaacactga gctttcagga gtaccgttga 360 ctggaatgag tcctctttcc ccatacctca atgttgaccc tcgttacctc atacaggaca 420 ctgatgagtt catcttgcca actggggcaa ataaaactcg tggtcgcttt gagctggcct 480 tcttcaccat tggtggatgt tgtataacag gagctgcttt tgggacactc aatggtctcc 540 gtatgggcct gtcagaaact agagacatgc catggtcaaa acccagaaat gtacagattt 600 tgaacatggt cacacgacaa ggggcatcgt gggccaacac actaggatca gtagctcttt 660 tgtatagcgt gtttggtgtg gccattgaga gggccagggg tgcagaggac gacctcaaca 720 cagtggcagc tggaacatta accgggatgg tatttaaatc cacaggtgga ctcaaaggag 780 ttgccagagg aggtctcatt ggtttagcca tgtcagggct gtatgctctg tacaacaact 840 gggatcacct aaaggggaaa tctccttcac attactgaga atgactgaga actcaaacca 900 tggacttatt ttgactctga gccaattctt ggtatttatt gccccccttt ggagatccat 960 tcatttctac aagatacaaa aggaccccca catgtgcaca gattcttcaa agttaggtta 1020 agtaaactat ttaaatggaa ataccaaact gcctccaaat atgtggacta tgttgcaatg 1080 agttagctgg taacattgtg ttcgttcatg tatgtttttg ttgatgttaa gttgcatctg 1140 gaggaacttt atgctttgat tacgaggttt tatctgttct cattgtggat ttatgcgttt 1200 tttcagtgct gattaaactg ggctcttcct ctttagaaaa aaaaaaaaaa aaaa 1254 142 208 PRT Danio rerio 142 Met Asp Asn Asn Thr Pro Pro Pro Gly Gly Phe Lys Gly Gly Leu Gly 1 5 10 15 Ser Ile Phe Gly Gly Gly Thr Pro Glu Tyr Ser Asn Thr Glu Leu Ser 20 25 30 Gly Val Pro Leu Thr Gly Met Ser Pro Leu Ser Pro Tyr Leu Asn Val 35 40 45 Asp Pro Arg Tyr Leu Ile Gln Asp Thr Asp Glu Phe Ile Leu Pro Thr 50 55 60 Gly Ala Asn Lys Thr Arg Gly Arg Phe Glu Leu Ala Phe Phe Thr Ile 65 70 75 80 Gly Gly Cys Cys Ile Thr Gly Ala Ala Phe Gly Thr Leu Asn Gly Leu 85 90 95 Arg Met Gly Leu Ser Glu Thr Arg Asp Met Pro Trp Ser Lys Pro Arg 100 105 110 Asn Val Gln Ile Leu Asn Met Val Thr Arg Gln Gly Ala Ser Trp Ala 115 120 125 Asn Thr Leu Gly Ser Val Ala Leu Leu Tyr Ser Val Phe Gly Val Ala 130 135 140 Ile Glu Arg Ala Arg Gly Ala Glu Asp Asp Leu Asn Thr Val Ala Ala 145 150 155 160 Gly Thr Leu Thr Gly Met Val Phe Lys Ser Thr Gly Gly Leu Lys Gly 165 170 175 Val Ala Arg Gly Gly Leu Ile Gly Leu Ala Met Ser Gly Leu Tyr Ala 180 185 190 Leu Tyr Asn Asn Trp Asp His Leu Lys Gly Lys Ser Pro Ser His Tyr 195 200 205 143 2850 DNA Danio rerio 143 cgccgagacg cgcgcacacg tgtctcacta gcgtctgcgg tgttttcatc gcacatttct 60 ttaatttcct cgattatcta ccatccgtga gagcttcaga aatggcccag agaaagaaaa 120 agctgatgag gaggaagaaa agcggcgcaa accgcgatca cgagctccag tccgacgatg 180 aagagtttga ggtggccgca ggagtcaaag atgacgaaaa taccacggtc aggaggcttc 240 ctcggtttcc ggcctcgtct gagtgcgtgt ctgacgtgga gctggacacc agagagctgg 300 tcagagctca gaacaagaag aagaagaaat ctggaggctt tcagtctatg ggtctcagct 360 atccggtgta taaaggcatc atgaagaagg gctataaagt gcccacaccc atccagagga 420 agacgatccc ggtgattctg gacggtaagg atgtggtggc gatggcccgc accggcagcg 480 gtaagactgc agcgttcctc gttcccctgt ttgagaagct gaaggctccg caggcacaga 540 cgggggctcg ggcactgatc ctcacaccca cccgagagct cgccctgcag accatgaagt 600 tcaccaaaga gctggggaaa ttcactggtc tgagaacagc gctcattctc ggtggagaca 660 gcatggatga tcagttcgct gctcttcacg agaacccaga catcatcatc ggtactcctg 720 gtcgtctgat gcacgtcatc caggagatga acctgaagct gcagtcggtg gagtatgtgg 780 tgttcgacga ggccgacaga ctgtttgaga tgggctttgc ggagcagctt caggagatca 840 tcagacgtct tccagacgcc cggcagacgc tgctgttctc cgctacgctt cccaaactca 900 tcgtggagtt cgccagagcc ggtctgacgg agccggtcct cattcgtctg gatgtggaca 960 ctaaactgag tgaacagctg aagctgtcgt tcttctctct gcgtctggat gataagccgg 1020 ctcttctgct gcacctgctg aggaacgtgg tgaagccgca ggagcagacg gtggtgttcg 1080 tggccactaa acatcatgtg gagtatctga aggagctgct gtcagcggag ggtgtggact 1140 gctcctgcat ctacagtgct ctggatcaga cggctcgtaa gatcagcatc ggtcgctttg 1200 ttcatcgtaa ggtgatgctg ctgctggtga cggatgtggc ggcgcgcggt atcgacatcc 1260 ccctgctgga caacgtcatc aactacaact tcccctgcaa acccaagctc ttcctgcaca 1320 gagtcggtcg tgtggctcgc gctggtcgag ggggaacagc atacagtctg gtgtgtcctg 1380 atgaggtgcc ttacctgtat gacctgcacc tgtttctggg tcgacccatg cagctcgcac 1440 accccgaaca cacacaagaa gcggacggtg tgttcggccg tgttcctcag agtgttctgg 1500 atgatgagga gtgtcagctg atcacggcgc atcagaactc tctggacctg cagaatctca 1560 gacgcgtgtc agaaaacgcc tacaagcagt acctgaagtc cagacccgtc ccgtctgcag 1620 agtccatcaa gcgcagccgc aacacacagc tcacagacat ggcggtacac ccgctgctgg 1680 gctgtgggct ggagaagatg gaactggatc gactgctgat ggtcgacacc atcaagggct 1740 acaaggccaa atctactatt tttgagatca actccagcaa taagacgagc gctagcgagg 1800 tgatgcgcgc caaacgctcc cgggaccgcc agcgggtgga gaagttcagc agagcaagag 1860 cagagctgag ggcggagcca ggggcgggac tgagggcgga gccaccagtg cacagagaga 1920 cacaggagga agacgaggag gagcaggagg agctgtcgac ggtgttctct gaggtggttg 1980 gaggtaagag acggaggccg gatgcggagc cgcacacaca caagagcaag aagagcagaa 2040 cagcgggccg agaccaggag ttctacatcc catacagacc caaagacttc aactcagagc 2100 gcgggctcag tctggactct ggtgcgggct cgtttgagca gcaggcctcc tctgcagttc 2160 tggatctgat tggagatgag agcaacacat taaaccagca caagagcctg atgaagtggg 2220 accgcaagaa gaagcgcttc gtccgtgaca ccggcaaaga ggacaaacac aagaagatca 2280 ggacggagag cggacaactg atcagaggac acaagaagaa gaagagcttt tatgaagagt 2340 ggaagaagaa gtataaggtg gagggcggag cttcagactc tgatggagaa gggggaggag 2400 ctcagagggg gcggggctct gcaggtcgtc ggggtcgtgg tcggggtcgc tctgcagcag 2460 gttcccaggc tacgccagcg cagcatcagg gcggttcagg tgttcgctcg gagctgaaga 2520 accggcagca gatcctgaag cagcgcaaga ggaaagcaaa gcagcagttc ctgcagagcg 2580 gaggcatgaa gaagctcaga aacaagggaa aacagcgcct ccgagaggtc atgaagtccg 2640 gcttcggccg cggagcagtc aagaagggca agatgaggaa gaaactttag gagaatgcat 2700 cgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgcgtgtgtg tgtgtgtgtg tgtgtgtgtg 2760 tgtgtgtgtg tgtgcgtgtg tgtgtatgtt tataagaata atatggagat cagtgagtgt 2820 gtgtcatctc tgcgtctgca ttaaaggtgt 2850 144 862 PRT Danio rerio 144 Met Ala Gln Arg Lys Lys Lys Leu Met Arg Arg Lys Lys Ser Gly Ala 1 5 10 15 Asn Arg Asp His Glu Leu Gln Ser Asp Asp Glu Glu Phe Glu Val Ala 20 25 30 Ala Gly Val Lys Asp Asp Glu Asn Thr Thr Val Arg Arg Leu Pro Arg 35 40 45 Phe Pro Ala Ser Ser Glu Cys Val Ser Asp Val Glu Leu Asp Thr Arg 50 55 60 Glu Leu Val Arg Ala Gln Asn Lys Lys Lys Lys Lys Ser Gly Gly Phe 65 70 75 80 Gln Ser Met Gly Leu Ser Tyr Pro Val Tyr Lys Gly Ile Met Lys Lys 85 90 95 Gly Tyr Lys Val Pro Thr Pro Ile Gln Arg Lys Thr Ile Pro Val Ile 100 105 110 Leu Asp Gly Lys Asp Val Val Ala Met Ala Arg Thr Gly Ser Gly Lys 115 120 125 Thr Ala Ala Phe Leu Val Pro Leu Phe Glu Lys Leu Lys Ala Pro Gln 130 135 140 Ala Gln Thr Gly Ala Arg Ala Leu Ile Leu Thr Pro Thr Arg Glu Leu 145 150 155 160 Ala Leu Gln Thr Met Lys Phe Thr Lys Glu Leu Gly Lys Phe Thr Gly 165 170 175 Leu Arg Thr Ala Leu Ile Leu Gly Gly Asp Ser Met Asp Asp Gln Phe 180 185 190 Ala Ala Leu His Glu Asn Pro Asp Ile Ile Ile Gly Thr Pro Gly Arg 195 200 205 Leu Met His Val Ile Gln Glu Met Asn Leu Lys Leu Gln Ser Val Glu 210 215 220 Tyr Val Val Phe Asp Glu Ala Asp Arg Leu Phe Glu Met Gly Phe Ala 225 230 235 240 Glu Gln Leu Gln Glu Ile Ile Arg Arg Leu Pro Asp Ala Arg Gln Thr 245 250 255 Leu Leu Phe Ser Ala Thr Leu Pro Lys Leu Ile Val Glu Phe Ala Arg 260 265 270 Ala Gly Leu Thr Glu Pro Val Leu Ile Arg Leu Asp Val Asp Thr Lys 275 280 285 Leu Ser Glu Gln Leu Lys Leu Ser Phe Phe Ser Leu Arg Leu Asp Asp 290 295 300 Lys Pro Ala Leu Leu Leu His Leu Leu Arg Asn Val Val Lys Pro Gln 305 310 315 320 Glu Gln Thr Val Val Phe Val Ala Thr Lys His His Val Glu Tyr Leu 325 330 335 Lys Glu Leu Leu Ser Ala Glu Gly Val Asp Cys Ser Cys Ile Tyr Ser 340 345 350 Ala Leu Asp Gln Thr Ala Arg Lys Ile Ser Ile Gly Arg Phe Val His 355 360 365 Arg Lys Val Met Leu Leu Leu Val Thr Asp Val Ala Ala Arg Gly Ile 370 375 380 Asp Ile Pro Leu Leu Asp Asn Val Ile Asn Tyr Asn Phe Pro Cys Lys 385 390 395 400 Pro Lys Leu Phe Leu His Arg Val Gly Arg Val Ala Arg Ala Gly Arg 405 410 415 Gly Gly Thr Ala Tyr Ser Leu Val Cys Pro Asp Glu Val Pro Tyr Leu 420 425 430 Tyr Asp Leu His Leu Phe Leu Gly Arg Pro Met Gln Leu Ala His Pro 435 440 445 Glu His Thr Gln Glu Ala Asp Gly Val Phe Gly Arg Val Pro Gln Ser 450 455 460 Val Leu Asp Asp Glu Glu Cys Gln Leu Ile Thr Ala His Gln Asn Ser 465 470 475 480 Leu Asp Leu Gln Asn Leu Arg Arg Val Ser Glu Asn Ala Tyr Lys Gln 485 490 495 Tyr Leu Lys Ser Arg Pro Val Pro Ser Ala Glu Ser Ile Lys Arg Ser 500 505 510 Arg Asn Thr Gln Leu Thr Asp Met Ala Val His Pro Leu Leu Gly Cys 515 520 525 Gly Leu Glu Lys Met Glu Leu Asp Arg Leu Leu Met Val Asp Thr Ile 530 535 540 Lys Gly Tyr Lys Ala Lys Ser Thr Ile Phe Glu Ile Asn Ser Ser Asn 545 550 555 560 Lys Thr Ser Ala Ser Glu Val Met Arg Ala Lys Arg Ser Arg Asp Arg 565 570 575 Gln Arg Val Glu Lys Phe Ser Arg Ala Arg Ala Glu Leu Arg Ala Glu 580 585 590 Pro Gly Ala Gly Leu Arg Ala Glu Pro Pro Val His Arg Glu Thr Gln 595 600 605 Glu Glu Asp Glu Glu Glu Gln Glu Glu Leu Ser Thr Val Phe Ser Glu 610 615 620 Val Val Gly Gly Lys Arg Arg Arg Pro Asp Ala Glu Pro His Thr His 625 630 635 640 Lys Ser Lys Lys Ser Arg Thr Ala Gly Arg Asp Gln Glu Phe Tyr Ile 645 650 655 Pro Tyr Arg Pro Lys Asp Phe Asn Ser Glu Arg Gly Leu Ser Leu Asp 660 665 670 Ser Gly Ala Gly Ser Phe Glu Gln Gln Ala Ser Ser Ala Val Leu Asp 675 680 685 Leu Ile Gly Asp Glu Ser Asn Thr Leu Asn Gln His Lys Ser Leu Met 690 695 700 Lys Trp Asp Arg Lys Lys Lys Arg Phe Val Arg Asp Thr Gly Lys Glu 705 710 715 720 Asp Lys His Lys Lys Ile Arg Thr Glu Ser Gly Gln Leu Ile Arg Gly 725 730 735 His Lys Lys Lys Lys Ser Phe Tyr Glu Glu Trp Lys Lys Lys Tyr Lys 740 745 750 Val Glu Gly Gly Ala Ser Asp Ser Asp Gly Glu Gly Gly Gly Ala Gln 755 760 765 Arg Gly Arg Gly Ser Ala Gly Arg Arg Gly Arg Gly Arg Gly Arg Ser 770 775 780 Ala Ala Gly Ser Gln Ala Thr Pro Ala Gln His Gln Gly Gly Ser Gly 785 790 795 800 Val Arg Ser Glu Leu Lys Asn Arg Gln Gln Ile Leu Lys Gln Arg Lys 805 810 815 Arg Lys Ala Lys Gln Gln Phe Leu Gln Ser Gly Gly Met Lys Lys Leu 820 825 830 Arg Asn Lys Gly Lys Gln Arg Leu Arg Glu Val Met Lys Ser Gly Phe 835 840 845 Gly Arg Gly Ala Val Lys Lys Gly Lys Met Arg Lys Lys Leu 850 855 860 145 2407 DNA Danio rerio 145 ccacgcgtcc gcccacgcgt ccgcccacgc gtccggctcc gtgtggacag tcgtgcgctg 60 gcgggagttt gccggattaa agccgtttaa aacgaagaaa tcgcaaagct ttcctgaaac 120 tcggccttca ctgaaacaca atgggcgaca gtgatgacga gtacgaccgc agacgcagag 180 acaagttccg ccgggagagg agcgactatg accggtccag ggagagagag gaccgccgta 240 gggatgactg gaatgaccgg cggccctcgg ctcgggaatg ggaccggggc cgagagcgtc 300 gcagcagagg ggagtaccgg gactacgaca ggggccgacg agagagattc tcccctccac 360 gacatgacat gagcccccaa cagaagcgca tgaggagaga ctgggatgac catggaggtg 420 atccgtacca tggtggctat gatctgggtt atggtggcgg tggaggaccc agctacgccc 480 ctcctcagcc gtggggtcac cctgacatgc acctgatgca gccccatcat ggaatcccca 540 tccaggccag gctgggaaac atccacgata tggatcttgg tcctcctcct cctgtgatga 600 agacctttaa ggagtttctg atttctctgg atgactctgt ggacgagacg gagtcggtca 660 aacgctacaa tgagtacaag attgatttta ggcgtcagca gatgcaggac ttcttcctgg 720 cccataaaga tgaagagtgg tttcgctcaa agtaccaccc ggatgaggct gggaggcgta 780 aagttgaggc ccacagtgct ctgcagaacc ggctgggcgt ctacatgtac ctgatggaca 840 acaactggtt tgagtcggtc tctctggata tcgagcgtgc tccgcaaatc accaagattt 900 tagatgcagc cgtgataaag atggagggag gagcagaaaa tgacctgcgc atcctggagc 960 agcccagcga ggaagaagag gagcgtgaga gactgtcatc tggtggaact ccctctgaac 1020 cctctaaacg ggacgaaccc aagcctgctg atactgagaa caaaccctca gaggaaaaag 1080 acaagatgga agagggtgag gagagcgctg agaaagacgg agaaaaggct tctgcaggag 1140 gaagtgaagg agaggagaaa actgagaagg aagcacctgc tgaaccaatc cctgaaccca 1200 agaagctgag taagaagagg aagaggaagc acagtggaga cagcgaggat gaagccagtg 1260 catctgagag cgaatctgat tctgactccg actcaaactc tcactgctcg gagaaaccct 1320 cggagagaga gagagaacca gaagaagtgg aggaaaagga agaagaggaa gaagaggagg 1380 gtgaagcggc tgatggaaag gagcagacgg atgagcagac ggaaagagag aaggaaaaag 1440 agaagagagt caaggatgat cagcctccac gtcctcgacc tcttcacagg acgtgttctc 1500 tgttcatgag agagtatcgc tccaacaata tctaaagcag aaggtgagtg agtggcagaa 1560 gacttttgaa gagaaaatgg gacccttgtt cagtgtgaaa gaaaacctgt ctgaggaaga 1620 agcggagaaa atgggccgca aagacccaga acaggaagtg gaaaagtttg tggtggccaa 1680 cacgcaggaa atgggcaaag acaagtggat ctgccctgta agtggcaaaa agtttaaggg 1740 cccagagttt gtgaggaagc acatcctgaa caaacacggg gacaagattg aggaagtaaa 1800 gaaagaggtg gtgtttttca acaacttcct aatggatgct aaaagaccat gcatcccgga 1860 gatgaaaccc ccaccccatc caggcccagg acagggagtg ctgtcacccg gtggtcttcc 1920 cttcccaccg cagggtcctc aaggtctcat gggttttggt cagcccagac ctccacttat 1980 gggttatgga ggtggtcctc cgtatccccc taaccagtat ggtggaggca gaggcaacta 2040 tgataacttc cgtggacagg gcggatatcc tggaaaaccc agaaacagcc gagtgatgcg 2100 tggagatcct cgcaatatca tcgagtatcg tgatttagac gcacctgatg atgtggattt 2160 cttttagagc tccagtttta gtctctacat tttaacctgt ttttttttta cggttgtcac 2220 agcattttct cgttttttgc attcattcca gaagctgccc gccgccgtct cctttcacat 2280 gtattacagt agcaaagctt cactttgtgt tcaatgtcat tatgtacttc atgtcataat 2340 gtgagcgtgt gctttatggc ctttttttcc tccaataaaa ctttaatttg gatgcaaaaa 2400 aaaaaaa 2407 146 673 PRT Danio rerio 146 Met Gly Asp Ser Asp Asp Glu Tyr Asp Arg Arg Arg Arg Asp Lys Phe 1 5 10 15 Arg Arg Glu Arg Ser Asp Tyr Asp Arg Ser Arg Glu Arg Glu Asp Arg 20 25 30 Arg Arg Asp Asp Trp Asn Asp Arg Arg Pro Ser Ala Arg Glu Trp Asp 35 40 45 Arg Gly Arg Glu Arg Arg Ser Arg Gly Glu Tyr Arg Asp Tyr Asp Arg 50 55 60 Gly Arg Arg Glu Arg Phe Ser Pro Pro Arg His Asp Met Ser Pro Gln 65 70 75 80 Gln Lys Arg Met Arg Arg Asp Trp Asp Asp His Gly Gly Asp Pro Tyr 85 90 95 His Gly Gly Tyr Asp Leu Gly Tyr Gly Gly Gly Gly Gly Pro Ser Tyr 100 105 110 Ala Pro Pro Gln Pro Trp Gly His Pro Asp Met His Leu Met Gln Pro 115 120 125 His His Gly Ile Pro Ile Gln Ala Arg Leu Gly Asn Ile His Asp Met 130 135 140 Asp Leu Gly Pro Pro Pro Pro Val Met Lys Thr Phe Lys Glu Phe Leu 145 150 155 160 Ile Ser Leu Asp Asp Ser Val Asp Glu Thr Glu Ser Val Lys Arg Tyr 165 170 175 Asn Glu Tyr Lys Ile Asp Phe Arg Arg Gln Gln Met Gln Asp Phe Phe 180 185 190 Leu Ala His Lys Asp Glu Glu Trp Phe Arg Ser Lys Tyr His Pro Asp 195 200 205 Glu Ala Gly Arg Arg Lys Val Glu Ala His Ser Ala Leu Gln Asn Arg 210 215 220 Leu Gly Val Tyr Met Tyr Leu Met Asp Asn Asn Trp Phe Glu Ser Val 225 230 235 240 Ser Leu Asp Ile Glu Arg Ala Pro Gln Ile Thr Lys Ile Leu Asp Ala 245 250 255 Ala Val Ile Lys Met Glu Gly Gly Ala Glu Asn Asp Leu Arg Ile Leu 260 265 270 Glu Gln Pro Ser Glu Glu Glu Glu Glu Arg Glu Arg Leu Ser Ser Gly 275 280 285 Gly Thr Pro Ser Glu Pro Ser Lys Arg Asp Glu Pro Lys Pro Ala Asp 290 295 300 Thr Glu Asn Lys Pro Ser Glu Glu Lys Asp Lys Met Glu Glu Gly Glu 305 310 315 320 Glu Ser Ala Glu Lys Asp Gly Glu Lys Ala Ser Ala Gly Gly Ser Glu 325 330 335 Gly Glu Glu Lys Thr Glu Lys Glu Ala Pro Ala Glu Pro Ile Pro Glu 340 345 350 Pro Lys Lys Leu Ser Lys Lys Arg Lys Arg Lys His Ser Gly Asp Ser 355 360 365 Glu Asp Glu Ala Ser Ala Ser Glu Ser Glu Ser Asp Ser Asp Ser Asp 370 375 380 Ser Asn Ser His Cys Ser Glu Lys Pro Ser Glu Arg Glu Arg Glu Pro 385 390 395 400 Glu Glu Val Glu Glu Lys Glu Glu Glu Glu Glu Glu Glu Gly Glu Ala 405 410 415 Ala Asp Gly Lys Glu Gln Thr Asp Glu Gln Thr Glu Arg Glu Lys Glu 420 425 430 Lys Glu Lys Arg Val Lys Asp Asp Gln Pro Pro Arg Pro Arg Pro Leu 435 440 445 His Arg Thr Cys Ser Leu Phe Met Arg Glu Tyr Arg Ser Asn Asn Ile 450 455 460 Gln Lys Val Ser Glu Trp Gln Lys Thr Phe Glu Glu Lys Met Gly Pro 465 470 475 480 Leu Phe Ser Val Lys Glu Asn Leu Ser Glu Glu Glu Ala Glu Lys Met 485 490 495 Gly Arg Lys Asp Pro Glu Gln Glu Val Glu Lys Phe Val Val Ala Asn 500 505 510 Thr Gln Glu Met Gly Lys Asp Lys Trp Ile Cys Pro Val Ser Gly Lys 515 520 525 Lys Phe Lys Gly Pro Glu Phe Val Arg Lys His Ile Leu Asn Lys His 530 535 540 Gly Asp Lys Ile Glu Glu Val Lys Lys Glu Val Val Phe Phe Asn Asn 545 550 555 560 Phe Leu Met Asp Ala Lys Arg Pro Cys Ile Pro Glu Met Lys Pro Pro 565 570 575 Pro His Pro Gly Pro Gly Gln Gly Val Leu Ser Pro Gly Gly Leu Pro 580 585 590 Phe Pro Pro Gln Gly Pro Gln Gly Leu Met Gly Phe Gly Gln Pro Arg 595 600 605 Pro Pro Leu Met Gly Tyr Gly Gly Gly Pro Pro Tyr Pro Pro Asn Gln 610 615 620 Tyr Gly Gly Gly Arg Gly Asn Tyr Asp Asn Phe Arg Gly Gln Gly Gly 625 630 635 640 Tyr Pro Gly Lys Pro Arg Asn Ser Arg Val Met Arg Gly Asp Pro Arg 645 650 655 Asn Ile Ile Glu Tyr Arg Asp Leu Asp Ala Pro Asp Asp Val Asp Phe 660 665 670 Phe 147 2498 DNA Danio rerio 147 gggcactgtt ggcctactgg cttttctctc tcacccgggc ggcggagcgc accggtagcg 60 ctaattatcc gtttacaact cactgggact ccagacagag accgtgattt caggggaccg 120 tcatcatggc gattaagttc ttggaggtaa taaaaccgtt ttgtgcggtt ttgcctgaaa 180 tccagaaacc agaacgaaag atccagttca gagaaaaggt gctatggact gccatcacat 240 tgttcatctt tcttgtttgc tgtcagatcc cactgtttgg gatcatgtct tcagactcag 300 cagatccatt ctactggatg agagtcatct tggcttccaa cagaggtact ttgatggagt 360 tgggtatctc tcccattgtg acctctggtc tgatcatgca gctgctggct ggagctaaaa 420 tcattgaggt cggagacaca ccgaaagaca gagcgctctt taatggagcc caaaaattgt 480 ttggtatgat catcaccatt ggccaggcta ttgtgtacgt catgactgga atgtatggag 540 acccttcaga gatgggtgct ggcatctgtc tgctgatcat cattcagttg tttgtggctg 600 gtctgatcgt gttgctgctg gatgagttgc tgcagaaagg ttatggtttg ggctctggta 660 tctctctctt cattgccact aacatctgtg agactatcgt atggaaagca ttcagcccaa 720 ccacagtcaa cacaggcaga ggtactgagt ttgagggagc cattattgct ctgttccacc 780 ttttggccac tcgcactgat aaagtcagag ctctgagaga ggctttctac agacagaacc 840 tgcctaacct catgaacctc atcgctacag tcttcgtctt tgctgtggtc atatacttcc 900 agggcttcag agttgatctg cccatcaagt ctgcacgtta ccgtggccaa tacaacacat 960 atcccatcaa gctgttctat acctccaaca ttcccatcat cctgcagtct gctctggttt 1020 ccaatcttta cgtcatctct cagatgctct ccactcaatt cagcggcaac ttcctagtca 1080 accttctggg gacttggtct gacacctcat ccggtggtcc agcgcgcgcc tatcctgtgg 1140 gcggtctctg ttactatctg tctcctccgg agtcgtttgg ttctgtgctt gatgacccag 1200 ttcatgcagt gatctacatt gtgttcatgc tgggatcctg tgccttcttt tcaaagacct 1260 ggattgaagt ttctggatct tctgccaaag atgtagctaa acagctgaaa gaacagcaca 1320 tggtgatgag gggacacact gggcgccatc ggctccggaa cgggaatcct attggctgtt 1380 accatcatct atcagtactt tgagatattt gttaaggaac agagtgaagt cggcagcatg 1440 ggagcgctac tcttctaaac aaggacactc acaaacacac acatacttgc tgacaccttt 1500 ttattatttt tatgattgtt gttgtcattt aattttttta tttcggtgca atttgagcat 1560 aattagtggt ctgtcacgag cgctgtcttg ggagaacgtt ctcttagccg ctagctcagg 1620 ccgagtaagt tctcctgtat gtcagaagtg gcaaaacgtt tctaaaattc agtgatgaag 1680 ctgaacttca gtaatggaac agtaaagatg gactctgtga gctctgacta ggaatattta 1740 gatgaattaa attttctctt cacatcagtc tgtccttcat ctgtctccag ccatcaccga 1800 atccattcat ccactctttt ctttagttcc tgcctggtcc tagttgataa agattttcca 1860 gagagtattg cttatattgg gtgactgatt tactcgacac tgccaatttt gacaactgct 1920 tcgtgtttgt cagtcatatc ggtcttgtat gtaaagctgg caggcagttt gcttttctaa 1980 ctcattgaat catgctaagt cataaaaact gtgatttagt tatggcttca aatataagag 2040 ctaaaccaca ggttcttgtt ttccagtcag caattgtcaa ttttaaacaa ggcagcatag 2100 gagtaaaacc acgcacaatt tggccacaca tgtgtgtgtg gtggtttttc tgtgtttgtt 2160 gaacaagtat ttcaaaacgc tgggatcttt tttttttttt ttttttttgt acccaacaca 2220 tcttacacaa tagagagcaa ggattttctg tatttgaaag tacctggtct ttagtgagag 2280 cttcagtgac agatggaagt ggtcactttc ccaaattaag ccctttttac tgtatcattg 2340 ggtgtctgta tatatgacat ttaatgctgt gcgcgaccat gtaaaataag cccttctttt 2400 tctttccttt tattttgggc tgtaaaaaaa aatgttccga acaaatatat tcagcaaaga 2460 ctgatcattt tcaataaaaa ctcctttttt gggaaaaa 2498 148 425 PRT Danio rerio 148 Met Ala Ile Lys Phe Leu Glu Val Ile Lys Pro Phe Cys Ala Val Leu 1 5 10 15 Pro Glu Ile Gln Lys Pro Glu Arg Lys Ile Gln Phe Arg Glu Lys Val 20 25 30 Leu Trp Thr Ala Ile Thr Leu Phe Ile Phe Leu Val Cys Cys Gln Ile 35 40 45 Pro Leu Phe Gly Ile Met Ser Ser Asp Ser Ala Asp Pro Phe Tyr Trp 50 55 60 Met Arg Val Ile Leu Ala Ser Asn Arg Gly Thr Leu Met Glu Leu Gly 65 70 75 80 Ile Ser Pro Ile Val Thr Ser Gly Leu Ile Met Gln Leu Leu Ala Gly 85 90 95 Ala Lys Ile Ile Glu Val Gly Asp Thr Pro Lys Asp Arg Ala Leu Phe 100 105 110 Asn Gly Ala Gln Lys Leu Phe Gly Met Ile Ile Thr Ile Gly Gln Ala 115 120 125 Ile Val Tyr Val Met Thr Gly Met Tyr Gly Asp Pro Ser Glu Met Gly 130 135 140 Ala Gly Ile Cys Leu Leu Ile Ile Ile Gln Leu Phe Val Ala Gly Leu 145 150 155 160 Ile Val Leu Leu Leu Asp Glu Leu Leu Gln Lys Gly Tyr Gly Leu Gly 165 170 175 Ser Gly Ile Ser Leu Phe Ile Ala Thr Asn Ile Cys Glu Thr Ile Val 180 185 190 Trp Lys Ala Phe Ser Pro Thr Thr Val Asn Thr Gly Arg Gly Thr Glu 195 200 205 Phe Glu Gly Ala Ile Ile Ala Leu Phe His Leu Leu Ala Thr Arg Thr 210 215 220 Asp Lys Val Arg Ala Leu Arg Glu Ala Phe Tyr Arg Gln Asn Leu Pro 225 230 235 240 Asn Leu Met Asn Leu Ile Ala Thr Val Phe Val Phe Ala Val Val Ile 245 250 255 Tyr Phe Gln Gly Phe Arg Val Asp Leu Pro Ile Lys Ser Ala Arg Tyr 260 265 270 Arg Gly Gln Tyr Asn Thr Tyr Pro Ile Lys Leu Phe Tyr Thr Ser Asn 275 280 285 Ile Pro Ile Ile Leu Gln Ser Ala Leu Val Ser Asn Leu Tyr Val Ile 290 295 300 Ser Gln Met Leu Ser Thr Gln Phe Ser Gly Asn Phe Leu Val Asn Leu 305 310 315 320 Leu Gly Thr Trp Ser Asp Thr Ser Ser Gly Gly Pro Ala Arg Ala Tyr 325 330 335 Pro Val Gly Gly Leu Cys Tyr Tyr Leu Ser Pro Pro Glu Ser Phe Gly 340 345 350 Ser Val Leu Asp Asp Pro Val His Ala Val Ile Tyr Ile Val Phe Met 355 360 365 Leu Gly Ser Cys Ala Phe Phe Ser Lys Thr Trp Ile Glu Val Ser Gly 370 375 380 Ser Ser Ala Lys Asp Val Ala Lys Gln Leu Lys Glu Gln His Met Val 385 390 395 400 Met Arg Gly His Thr Gly Arg His Arg Leu Arg Asn Gly Asn Pro Ile 405 410 415 Gly Cys Tyr His His Leu Ser Val Leu 420 425 149 1820 DNA Danio rerio 149 cgtttccttt gttgtgatat cgaaaaaaaa cgtagtctta cttaaaatct cagcttctgg 60 attaagaaat caagcagttg tcgtctgtcg gtattgatca gccgtaaaaa tgagtggcgg 120 agtctacgga ggagatgaag ttggagctct tgtctttgat atgggctcgt attcagtgag 180 agctggatat gcaggagagg attgtgccaa ggctgatttt cctactgtga ttggtgtgac 240 tctggaccgg gaagatggca gtacaccgat ggagacagat ggggagaaag gaaagcaaag 300 cggaaccacc tacttcattg acaccaacca gctcagggtg cctagagaaa gcatggaggt 360 catgtcacct ctcaaaaatg ggatgattga ggactgggac agttttcaag ccattttaga 420 tcatacctac aaaatgcact tcaagtcaca gcccggtctg catccagtcc tgatgtcaga 480 ggcctcgtgg aacacacgag cgaaaagaga gaagctgacg gagctgatgt ttgagcatta 540 caacattccc gctttcttct tgtgtaaatc agctgtgctg tcagcgtttg ccaatggacg 600 atccacaggc ttagtgttag atagcggagc aacacatact actgctattc cagtgcatga 660 tggttatgtc ctacaacaag gcatcgtaaa gtctcctctt gctggtgact tcatgagtat 720 gcaatgtaga gagctgtttc aagagttagg tgttgaaata gtgcctcctt atatgattgc 780 atcaaaggat tcagttcgtg aaggcactcc tgctagttgg aagaaaaagg agaaactacc 840 tcaagttacc cgatcatggc ataactatat gtgtaatacc gtcatccagg attttcaggc 900 ctctgtgctc caggtgtcag attcacccta tgatgaacaa gtggctgccc agatgcccac 960 tgttcactat gagctgccta atggctacaa ctgtgacttt ggagcggaaa gacttaagat 1020 tcctgagggt ctttttgacc cttcaaatgc taagggtcta tcagggaaca ccatgttagg 1080 agtcggccat gttgtgacca ccagtgttgg catgtgtgac attgacattc gaccgggtct 1140 gtatggcagt gtggtagtta caggaggaaa taccctcatt caaggcttta cagacagact 1200 taacagagaa ctctcgcaga agacaccacc tagcatgcgt cttaagctaa tagccaataa 1260 cacaactgtg gagcgccgat ttagcgcctg gattggagga tccatccttg catccctggg 1320 aactttccaa caaatgtgga tttcaaaaca ggagtatgaa gaaggtggca aacagtgtgt 1380 tgacaggaag tgcccttaac acgttccttc aacagctggt tgccttcaga aaatgaaatc 1440 aaaataaata acacctccac ctagtagaaa aagccttgtt agtggactca ggaactaagg 1500 acacttgctg ttatttttag tgcaaaccaa cccatatttt aaacagttta acatggttgc 1560 actcttttct actctacaca atattttcat atccagtttg gcaacacaaa cttttccgac 1620 cacctaattc atttcagagt aacagtgaca aggttatggg gatatgcgaa acctctataa 1680 tatgtttagg acaattactg tttttttttt tttaatttac accactttgt cactttctag 1740 aaaatggatc acagaaaaca aatgtttgtt atgcttttaa caactggatc aactgattga 1800 ggaataaaat tgtttttgtt 1820 150 429 PRT Danio rerio 150 Met Ser Gly Gly Val Tyr Gly Gly Asp Glu Val Gly Ala Leu Val Phe 1 5 10 15 Asp Met Gly Ser Tyr Ser Val Arg Ala Gly Tyr Ala Gly Glu Asp Cys 20 25 30 Ala Lys Ala Asp Phe Pro Thr Val Ile Gly Val Thr Leu Asp Arg Glu 35 40 45 Asp Gly Ser Thr Pro Met Glu Thr Asp Gly Glu Lys Gly Lys Gln Ser 50 55 60 Gly Thr Thr Tyr Phe Ile Asp Thr Asn Gln Leu Arg Val Pro Arg Glu 65 70 75 80 Ser Met Glu Val Met Ser Pro Leu Lys Asn Gly Met Ile Glu Asp Trp 85 90 95 Asp Ser Phe Gln Ala Ile Leu Asp His Thr Tyr Lys Met His Phe Lys 100 105 110 Ser Gln Pro Gly Leu His Pro Val Leu Met Ser Glu Ala Ser Trp Asn 115 120 125 Thr Arg Ala Lys Arg Glu Lys Leu Thr Glu Leu Met Phe Glu His Tyr 130 135 140 Asn Ile Pro Ala Phe Phe Leu Cys Lys Ser Ala Val Leu Ser Ala Phe 145 150 155 160 Ala Asn Gly Arg Ser Thr Gly Leu Val Leu Asp Ser Gly Ala Thr His 165 170 175 Thr Thr Ala Ile Pro Val His Asp Gly Tyr Val Leu Gln Gln Gly Ile 180 185 190 Val Lys Ser Pro Leu Ala Gly Asp Phe Met Ser Met Gln Cys Arg Glu 195 200 205 Leu Phe Gln Glu Leu Gly Val Glu Ile Val Pro Pro Tyr Met Ile Ala 210 215 220 Ser Lys Asp Ser Val Arg Glu Gly Thr Pro Ala Ser Trp Lys Lys Lys 225 230 235 240 Glu Lys Leu Pro Gln Val Thr Arg Ser Trp His Asn Tyr Met Cys Asn 245 250 255 Thr Val Ile Gln Asp Phe Gln Ala Ser Val Leu Gln Val Ser Asp Ser 260 265 270 Pro Tyr Asp Glu Gln Val Ala Ala Gln Met Pro Thr Val His Tyr Glu 275 280 285 Leu Pro Asn Gly Tyr Asn Cys Asp Phe Gly Ala Glu Arg Leu Lys Ile 290 295 300 Pro Glu Gly Leu Phe Asp Pro Ser Asn Ala Lys Gly Leu Ser Gly Asn 305 310 315 320 Thr Met Leu Gly Val Gly His Val Val Thr Thr Ser Val Gly Met Cys 325 330 335 Asp Ile Asp Ile Arg Pro Gly Leu Tyr Gly Ser Val Val Val Thr Gly 340 345 350 Gly Asn Thr Leu Ile Gln Gly Phe Thr Asp Arg Leu Asn Arg Glu Leu 355 360 365 Ser Gln Lys Thr Pro Pro Ser Met Arg Leu Lys Leu Ile Ala Asn Asn 370 375 380 Thr Thr Val Glu Arg Arg Phe Ser Ala Trp Ile Gly Gly Ser Ile Leu 385 390 395 400 Ala Ser Leu Gly Thr Phe Gln Gln Met Trp Ile Ser Lys Gln Glu Tyr 405 410 415 Glu Glu Gly Gly Lys Gln Cys Val Asp Arg Lys Cys Pro 420 425 151 2213 DNA Danio rerio 151 gtgcgcagaa agcccggcag gcgcaggctg taattggagg gtaaaatggc gctgagttct 60 caaggaacaa agaagaaagt ttgctattat tatgacgggg atgtcgggaa ttactactat 120 ggtcagggcc atcccatgaa gccccacaga atccgcatga cccacaacct tctgctcaac 180 tatgggctgt ataggaaaat ggagatttat cgacctcata aagccaatgc ggaagaaatg 240 accaagtacc acagtgacga ctacatcaag ttcctccgct ccatccgccc agacaacatg 300 tccgagtaca gcaagcagat gcagagattt aacgtagggg aggattgtcc tgtctttgac 360 ggcttatttg agttctgtca gctctcaaca ggtggatctg ttgctggtgc tgtgaaactc 420 aacaaacagc agacagacat tgccattaac tgggcaggag gtctacatca tgctaagaaa 480 tcagaggcat ctggattctg ctatgtcaac gacattgtac tggccattct ggagctgctc 540 aagtaccatc aaagagtgct ttacatcgac attgacattc atcacggtga tggagttgag 600 gaagcctttt acacaacaga ccgtgtcatg actgtgtctt tccacaagta tggagagtat 660 ttcccgggta ctggagacct cagagatatt ggtgcaggca aggggaagta ttacgctgta 720 aattacccgc tcagggatgg tattgatgat gagtcctatg aagccatatt caaacctatc 780 atgtccaaag tgatggagat gtaccagccc agtgctgtgg tgctccagtg cggcgctgac 840 tctctgtccg gagatcgatt gggctgcttt aaccttacca ttaaaggtca tgccaagtgt 900 gtagagtaca tgaagagctt taacctgccg ctgctcatgc tgggaggtgg aggctacacc 960 attaaaaacg tggcccgctg ctggactttc gagacagctg tagcactgga cagcaccatt 1020 cctaatgagc tcccgtacaa tgattacttt gagtattttg gacctgactt caaacttcac 1080 atcagcccct tcaacatgac caatcaaaac acaaatgact acctggaaaa gatcaagcag 1140 cgtctgtttg agaatctgcg gatgttgcct catgctccag gcgttcagat gcaggccatc 1200 ccggaggatg cagttcagga agacagcgga gatgaagagg acgaccctga caaacgcatc 1260 tccattcgtg ctcacgataa gcggatagcc tgtgatgaag agttctcaga ctctgaggat 1320 gaaggccagg gtggacggag aaacgcagcc aattacaaga agccaaaacg agtgaagact 1380 gaagaagaga aggatggaga agagaagaaa gatgttaaag aagaggagaa agcttcagaa 1440 gagaaaatgg acacaaaggg gccaaaagaa gaattaaaaa cagtgtgagg tgggggagaa 1500 ggactgccag tcctcctgga tgcatgtact taagtcgtca aacaccctca tcttcaaaca 1560 tcaaacccac atttacacac acacacatac acatacatgc tagatgttag ctctagtcct 1620 gagctaagga ctgtaaatta ttttgtaaga ctcgttttac tatttgtaat ctaatgaagc 1680 ggtataaaag tttttacatt tacaagtgtg aaatgaccag atgttttttt gtttcgtgct 1740 ttggaaccat gtttgtttgc ctcccttatg ttactattta gcgacgatag cacctcagac 1800 acaggaccac aaatagcaca taactttact gtcccatcat gggatgcttt tttttctctt 1860 ccccctcccc ctcaaataat tttaaatcat ggatggagat acttattggc cgctcctact 1920 tgggttttct ctgcggatgc tgtttggtgt ccggtgtcct ttacctcgtt tttaagcgct 1980 aaatgtttgg tttttattct ggtgtctgtc tctcgatacc tacatctgtg cggtgtgtaa 2040 gtactccggt atggtcgggg catggcgaag gcttgctgta ttgctgttag attcaagatt 2100 tttctttctg ttgtgcttta ttctagcatg tcagtagtgc ccctgaccag acatttagcc 2160 gtttgattat tccagagttc agtggttgtg aaatgtcctt attataacaa tga 2213 152 480 PRT Danio rerio 152 Met Ala Leu Ser Ser Gln Gly Thr Lys Lys Lys Val Cys Tyr Tyr Tyr 1 5 10 15 Asp Gly Asp Val Gly Asn Tyr Tyr Tyr Gly Gln Gly His Pro Met Lys 20 25 30 Pro His Arg Ile Arg Met Thr His Asn Leu Leu Leu Asn Tyr Gly Leu 35 40 45 Tyr Arg Lys Met Glu Ile Tyr Arg Pro His Lys Ala Asn Ala Glu Glu 50 55 60 Met Thr Lys Tyr His Ser Asp Asp Tyr Ile Lys Phe Leu Arg Ser Ile 65 70 75 80 Arg Pro Asp Asn Met Ser Glu Tyr Ser Lys Gln Met Gln Arg Phe Asn 85 90 95 Val Gly Glu Asp Cys Pro Val Phe Asp Gly Leu Phe Glu Phe Cys Gln 100 105 110 Leu Ser Thr Gly Gly Ser Val Ala Gly Ala Val Lys Leu Asn Lys Gln 115 120 125 Gln Thr Asp Ile Ala Ile Asn Trp Ala Gly Gly Leu His His Ala Lys 130 135 140 Lys Ser Glu Ala Ser Gly Phe Cys Tyr Val Asn Asp Ile Val Leu Ala 145 150 155 160 Ile Leu Glu Leu Leu Lys Tyr His Gln Arg Val Leu Tyr Ile Asp Ile 165 170 175 Asp Ile His His Gly Asp Gly Val Glu Glu Ala Phe Tyr Thr Thr Asp 180 185 190 Arg Val Met Thr Val Ser Phe His Lys Tyr Gly Glu Tyr Phe Pro Gly 195 200 205 Thr Gly Asp Leu Arg Asp Ile Gly Ala Gly Lys Gly Lys Tyr Tyr Ala 210 215 220 Val Asn Tyr Pro Leu Arg Asp Gly Ile Asp Asp Glu Ser Tyr Glu Ala 225 230 235 240 Ile Phe Lys Pro Ile Met Ser Lys Val Met Glu Met Tyr Gln Pro Ser 245 250 255 Ala Val Val Leu Gln Cys Gly Ala Asp Ser Leu Ser Gly Asp Arg Leu 260 265 270 Gly Cys Phe Asn Leu Thr Ile Lys Gly His Ala Lys Cys Val Glu Tyr 275 280 285 Met Lys Ser Phe Asn Leu Pro Leu Leu Met Leu Gly Gly Gly Gly Tyr 290 295 300 Thr Ile Lys Asn Val Ala Arg Cys Trp Thr Phe Glu Thr Ala Val Ala 305 310 315 320 Leu Asp Ser Thr Ile Pro Asn Glu Leu Pro Tyr Asn Asp Tyr Phe Glu 325 330 335 Tyr Phe Gly Pro Asp Phe Lys Leu His Ile Ser Pro Phe Asn Met Thr 340 345 350 Asn Gln Asn Thr Asn Asp Tyr Leu Glu Lys Ile Lys Gln Arg Leu Phe 355 360 365 Glu Asn Leu Arg Met Leu Pro His Ala Pro Gly Val Gln Met Gln Ala 370 375 380 Ile Pro Glu Asp Ala Val Gln Glu Asp Ser Gly Asp Glu Glu Asp Asp 385 390 395 400 Pro Asp Lys Arg Ile Ser Ile Arg Ala His Asp Lys Arg Ile Ala Cys 405 410 415 Asp Glu Glu Phe Ser Asp Ser Glu Asp Glu Gly Gln Gly Gly Arg Arg 420 425 430 Asn Ala Ala Asn Tyr Lys Lys Pro Lys Arg Val Lys Thr Glu Glu Glu 435 440 445 Lys Asp Gly Glu Glu Lys Lys Asp Val Lys Glu Glu Glu Lys Ala Ser 450 455 460 Glu Glu Lys Met Asp Thr Lys Gly Pro Lys Glu Glu Leu Lys Thr Val 465 470 475 480 153 1243 DNA Danio rerio 153 gggattgacc tcttcaagcg acacctctcc aagaacatcc aacaatgagt gagaccgcca 60 tctccttcgc caaggacttc ttggccggtg gtattgccgc tgccatctct aaaaccgccg 120 tggcccccat tgagagagtc aaactgctgc ttcaggtgca acatgctagc aaacagatta 180 cagcagataa gcagtacaag ggcattatgg actgcgtggt gcgtatcccc aaggagcagg 240 gcttcctgtc gttctggaga ggaaacttgg ccaacgtcat cagatacttc cccacacagg 300 ccctcaactt tgctttcaag gacaagtaca agaaggtctt ccttgatggt gtggacaagc 360 gcacccagtt ttggaggtac ttcgctggta acctggcttc aggtggtgct gctggtgcca 420 catccctctg cttcgtgtat ccccttgact tcgcaagaac ccgtcttgct gccgatgtcg 480 gaaaagctgg agcagaaaga gagttcagtg ggctgggtaa ctgcttggta aagatctcca 540 aatctgatgg catcaagggt ctgtaccagg gcttcaacgt gtccgtgcag ggtatcatca 600 tttacagagc tgcctacttc ggcatttatg acacagccaa gggtatgctg cccgatccca 660 agaacaccca tattgttgtg agttggatga ttgctcagag tgtgactgct gttgctggtc 720 ttgcttccta ccccttcgac acagtgcgtc gtcgtatgat gatgcagtct ggacgtaaag 780 gagctgacat catgtacagt ggcacaattg actgctggag gaagatcgca cgtgatgagg 840 gtggcaaggc tttcttcaag ggagcctggt ccaacgttct cagaggcatg ggtggcgcct 900 ttgtgctggt cttgtatgat gagctgaaga aggtcattta aatgttttcc tgtccaccat 960 tcaatgtgag atgtttaact gtaacatatc ttgtacattt ggaaggactc tcaaattccg 1020 ccttatttat agggaccaac tagtatgtca gtactagttg tactggggga aatttgtttt 1080 gcggtttttc ttttgccccg ttcatctctc ctttttgttc ccctcaccgt actgtcattc 1140 ccagaaagaa agaacaaaag aatgccaggt actaaaatct acctgttttc cagccggtct 1200 gtttttagtc ttcatcaata aagacccact gaatgaacct aaa 1243 154 298 PRT Danio rerio 154 Met Ser Glu Thr Ala Ile Ser Phe Ala Lys Asp Phe Leu Ala Gly Gly 1 5 10 15 Ile Ala Ala Ala Ile Ser Lys Thr Ala Val Ala Pro Ile Glu Arg Val 20 25 30 Lys Leu Leu Leu Gln Val Gln His Ala Ser Lys Gln Ile Thr Ala Asp 35 40 45 Lys Gln Tyr Lys Gly Ile Met Asp Cys Val Val Arg Ile Pro Lys Glu 50 55 60 Gln Gly Phe Leu Ser Phe Trp Arg Gly Asn Leu Ala Asn Val Ile Arg 65 70 75 80 Tyr Phe Pro Thr Gln Ala Leu Asn Phe Ala Phe Lys Asp Lys Tyr Lys 85 90 95 Lys Val Phe Leu Asp Gly Val Asp Lys Arg Thr Gln Phe Trp Arg Tyr 100 105 110 Phe Ala Gly Asn Leu Ala Ser Gly Gly Ala Ala Gly Ala Thr Ser Leu 115 120 125 Cys Phe Val Tyr Pro Leu Asp Phe Ala Arg Thr Arg Leu Ala Ala Asp 130 135 140 Val Gly Lys Ala Gly Ala Glu Arg Glu Phe Ser Gly Leu Gly Asn Cys 145 150 155 160 Leu Val Lys Ile Ser Lys Ser Asp Gly Ile Lys Gly Leu Tyr Gln Gly 165 170 175 Phe Asn Val Ser Val Gln Gly Ile Ile Ile Tyr Arg Ala Ala Tyr Phe 180 185 190 Gly Ile Tyr Asp Thr Ala Lys Gly Met Leu Pro Asp Pro Lys Asn Thr 195 200 205 His Ile Val Val Ser Trp Met Ile Ala Gln Ser Val Thr Ala Val Ala 210 215 220 Gly Leu Ala Ser Tyr Pro Phe Asp Thr Val Arg Arg Arg Met Met Met 225 230 235 240 Gln Ser Gly Arg Lys Gly Ala Asp Ile Met Tyr Ser Gly Thr Ile Asp 245 250 255 Cys Trp Arg Lys Ile Ala Arg Asp Glu Gly Gly Lys Ala Phe Phe Lys 260 265 270 Gly Ala Trp Ser Asn Val Leu Arg Gly Met Gly Gly Ala Phe Val Leu 275 280 285 Val Leu Tyr Asp Glu Leu Lys Lys Val Ile 290 295 155 1416 DNA Danio rerio 155 gtgcagcgct aaaatacgcc gagcttccgc gagacacttc atcgaggcac agctatgttt 60 atgtgctcaa tttagctgtt ctctgataac aatcagtatt atttactaac taagagccaa 120 agatgacctc aaaaacaaaa gtgggaaagg tgggcactaa aagcaaagaa gatgctcctc 180 atgaactgga gagtcagttt attttgcggc ttcctcagga atacgcctct acggttagac 240 ggattgccca atctagcagc atgaacatga aagacaggct tacaatagag ttgcatgctg 300 atggacgcca cgggattgta cgcgtggatc gtgttccttt agcatgtaaa ttggtggatt 360 taccctgcat cctggagtca ttaaaaactg ttgacaaaaa gactttttat aagacggccg 420 atatctgtca gatgctggta tgcacactag atggagatct gtacccccct ctagaggagc 480 ctacaggcac tactgactcc aaaagcaaaa agaaagacaa ggacaaagac aagaaatttg 540 tttggaacca cggcattaca cttcctctga agaacacaag aaagaggcgg ttcaggaaga 600 cagcgaagaa gaagtacatc gagtctcctg atgtggaaaa agaggtgaag agactcttga 660 gcacagatgc agaagccgtc agcgttcgat gggaggtgat agctgaagat gaatctaaag 720 aacctgacaa cagcttatct ctttccaacc tggagtcttc acctggaacc tctggacaca 780 agggtcatgg ctcttcagtc caacatgatg agcttcgcga gatcttcaac gacatcagca 840 gcagcagtga ggatgaggac gaagagggcg atcgacatga tgatgaagat ctaaacatca 900 tggacactga agatgacatg gtcaggcagc tccacgagaa actgaacgag tcagacggag 960 gcagagatga gaatgaccgg aacagtcaga tcgtaatgga gtaccagatg cagatcaaca 1020 atttgaaggc caagctccag gaaacacgtg cccgcaagaa gcaacaggag aaactgatca 1080 tggaggtgga aaatcaagcc ctgagggatc gcttccaggg acttttgaat ggtatgattc 1140 gtcaagagga gcaagagatg gaacagctgg cttcccttca agagcagctg gactcgctga 1200 tcgagaagtg atgctcacca ttaccaagat tcagcaatac aagcacactg accagaggcg 1260 actgctgaat acatacacac atgtctgact tcttgtatta gctgaatgtg agttgtctaa 1320 atatgtgact gaccatctgg cctcatcata ctgtggattg gaaggatcaa tcctgcattc 1380 gttctttctt ttgaccttgt ttttagttac ttgttt 1416 156 362 PRT Danio rerio 156 Met Thr Ser Lys Thr Lys Val Gly Lys Val Gly Thr Lys Ser Lys Glu 1 5 10 15 Asp Ala Pro His Glu Leu Glu Ser Gln Phe Ile Leu Arg Leu Pro Gln 20 25 30 Glu Tyr Ala Ser Thr Val Arg Arg Ile Ala Gln Ser Ser Ser Met Asn 35 40 45 Met Lys Asp Arg Leu Thr Ile Glu Leu His Ala Asp Gly Arg His Gly 50 55 60 Ile Val Arg Val Asp Arg Val Pro Leu Ala Cys Lys Leu Val Asp Leu 65 70 75 80 Pro Cys Ile Leu Glu Ser Leu Lys Thr Val Asp Lys Lys Thr Phe Tyr 85 90 95 Lys Thr Ala Asp Ile Cys Gln Met Leu Val Cys Thr Leu Asp Gly Asp 100 105 110 Leu Tyr Pro Pro Leu Glu Glu Pro Thr Gly Thr Thr Asp Ser Lys Ser 115 120 125 Lys Lys Lys Asp Lys Asp Lys Asp Lys Lys Phe Val Trp Asn His Gly 130 135 140 Ile Thr Leu Pro Leu Lys Asn Thr Arg Lys Arg Arg Phe Arg Lys Thr 145 150 155 160 Ala Lys Lys Lys Tyr Ile Glu Ser Pro Asp Val Glu Lys Glu Val Lys 165 170 175 Arg Leu Leu Ser Thr Asp Ala Glu Ala Val Ser Val Arg Trp Glu Val 180 185 190 Ile Ala Glu Asp Glu Ser Lys Glu Pro Asp Asn Ser Leu Ser Leu Ser 195 200 205 Asn Leu Glu Ser Ser Pro Gly Thr Ser Gly His Lys Gly His Gly Ser 210 215 220 Ser Val Gln His Asp Glu Leu Arg Glu Ile Phe Asn Asp Ile Ser Ser 225 230 235 240 Ser Ser Glu Asp Glu Asp Glu Glu Gly Asp Arg His Asp Asp Glu Asp 245 250 255 Leu Asn Ile Met Asp Thr Glu Asp Asp Met Val Arg Gln Leu His Glu 260 265 270 Lys Leu Asn Glu Ser Asp Gly Gly Arg Asp Glu Asn Asp Arg Asn Ser 275 280 285 Gln Ile Val Met Glu Tyr Gln Met Gln Ile Asn Asn Leu Lys Ala Lys 290 295 300 Leu Gln Glu Thr Arg Ala Arg Lys Lys Gln Gln Glu Lys Leu Ile Met 305 310 315 320 Glu Val Glu Asn Gln Ala Leu Arg Asp Arg Phe Gln Gly Leu Leu Asn 325 330 335 Gly Met Ile Arg Gln Glu Glu Gln Glu Met Glu Gln Leu Ala Ser Leu 340 345 350 Gln Glu Gln Leu Asp Ser Leu Ile Glu Lys 355 360 157 3050 DNA Danio rerio 157 tttaaatgta gtgagggagg aaaagacctc tgctcccttt acgttagagc tgatagtctc 60 ttatctgagc atccgagaaa tactcccgtt aatcgcacat cattgccgac gtcctaatga 120 tagatagagg aaaacgattc aagctgtgag actatcagga aaaaggaaac atactaggac 180 attctgtggt agaagttctg aattagagac aaaaaattgt atttttttat ttttatacaa 240 acaggccagt ttaaaacgat taaaaatgac tagaagatct tgtactattt acgccacagg 300 gattgtgtgc gctcatttgc taattttggg aatcgccctt ctgctggctc aagtcttcca 360 aacaatgatc caagaacgga taaaaaagga gatcacgctg gctgagaaca gcagagtatt 420 ggatggatgg ataaatccac cacctcctgt ttacatgcag tatttcttct tcaatgttac 480 caatcccgat gagttcttgg cggggaagga aaaggccaaa gttactcaga tggggccata 540 tacctacagg gaatatcgac caagagaaaa tgtgacttat cttgagaatg ggacaaagat 600 ttttgccaca aatcccaaga gctttgtgtt tcttcgcaac atgtcagctg gagatcctga 660 ggttgaccgt gtgacaaccg ttaatattcc aatgattgca gtgatgaatg agttaaactc 720 ctactccttc tttgtgagga cagctgtatc tatgtatatg gggtccatgg gcatgggcct 780 gtttatgaat cgtacagttc atgagatcct gtggggtttc aaagatccac ttctgaccaa 840 actccacgcc atgagaccag aagtggatga gcattttggc ctcatgtaca ataaaaatgg 900 aacccatgaa ggtgaatttg tcttccacac tggagagaag aattacatga actatggcaa 960 gattgacacg tggaacggca tcagtcagat gaattggtgg tcatctaacc agagcaacat 1020 gatcaatggt actgatggca gtgtcttcca taccttcctg tccagaaaag agctcctcta 1080 catctttgct gctgatctct gccgatctat ccatttgggt tacgtgaggg acatggaagt 1140 gaagggcatc ccagccttcc gcttcgctcc tccttcagat gtactagctc ctccagatga 1200 gaacccggca aatgctggct tctgcgtgcc tgctggagac tgtctaggaa aaggagtcct 1260 caaagtcagc gtctgtcgtc aaggtgcacc cattgttgtg tcattccctc atttctacca 1320 agctgatgag aggtacatca atgccattga agggatgaat ccaaatgagg aggaacatga 1380 aacatacctc gacatcaacc cgaccacagg ggttcctatt cgagcttgta agagagctca 1440 actcaatatc atactgaaaa gagtgcgtgg cttccctaac acaaaatttc tcaatgagac 1500 tattttcccc attatgtacg tcaatgagac agcaactatt gatgatgagt ctgcagccca 1560 gatgaggatg ctcctgttaa ttgtaacagt ggtatccaac tttccagtca tcattttggc 1620 attgggggtg attttacttg tagtccttat attcctcgtc tgcagaaacc gccagagaaa 1680 gaatgaagta aaacgtattg attttactga agcttttcat tcttttgcta caacaaaaga 1740 tgaaactgct tatacccaag tgagcaacca agcagaggat tcaccagaga accgcaacaa 1800 tcagccattg aggaatggat catatattgc catgtcacca gtggaggcac aaaagtgttg 1860 atgaagatga ccgatgaact ttcgcctgag gacattatgt ctattctcac cattttttaa 1920 tgtgttttac ttttcatcgc agcagtcttc tttgggggcc agtcaccacc tttggctgag 1980 ggtcacgcat tttactaaag acttttttct tttgtgtttg tttgtttcct tgagcatggc 2040 cttctcaaac tgccatatca aagcacccaa ttcagttttg tagaaccaac tctgggtttc 2100 cagaggcttc cgttgcctat tcctcagagg cctgaaacac actcttgcct ttttgaaaac 2160 acaaatatgc attgcatgtt cacagagtcg tacaccctct gagcatgagt gttctggatg 2220 ggtggaaata ctggggcccg ctgtgctttt taaagtgttt caaatctctt ctcacactac 2280 tgtatactgg tcagcacagg cagaccttga actaacacat ctcttcatat gtcaaaccct 2340 ttgacaatga gcagacaaat gcaaactgtt gatttctcac ctctagagta ttagggagct 2400 aaaaggtcta atatgcttac aaaaactggg tcttatccag gacttttgtg tgtgtgtgtg 2460 tgtgtgtgtg tgtgaaagag agagattgca catatgagtg tgtttgtaat tatatgtgag 2520 tatttcagca agcatgtcgg tgcttgcatg cttttgtcac attgtggtga actgctctgt 2580 tggctcactg tttaaacgac acagtgtgac agcaagtgga aaaaaatagt gacagtcctt 2640 tccatttcac agtgtctgaa gtacattcat tgggatgtca gaaacatttc agctgttaca 2700 aaaggtcaga aagtcattgt attgcataaa tcgcttagta ttttataatg aacaaacaac 2760 actcacaagt gagacttgaa taattctgcc ttacaaatac ttaagaacat aatgcggttc 2820 tgctgcatgg cattggcagg tggaactaga tgaattgcca acgaaacacg tctgttctgt 2880 gatagtgaga gcacaaatat agtatgcaga tcccatggtc taatcgtagc agaaggggtt 2940 aaaggtcagc gagttttgag cagtaaagtt ttgcacaatt cttttctgtg caagtccaat 3000 ttataataac cctctgtggg tgaaagacct agcacttgaa tcactacttg 3050 158 531 PRT Danio rerio 158 Met Thr Arg Arg Ser Cys Thr Ile Tyr Ala Thr Gly Ile Val Cys Ala 1 5 10 15 His Leu Leu Ile Leu Gly Ile Ala Leu Leu Leu Ala Gln Val Phe Gln 20 25 30 Thr Met Ile Gln Glu Arg Ile Lys Lys Glu Ile Thr Leu Ala Glu Asn 35 40 45 Ser Arg Val Leu Asp Gly Trp Ile Asn Pro Pro Pro Pro Val Tyr Met 50 55 60 Gln Tyr Phe Phe Phe Asn Val Thr Asn Pro Asp Glu Phe Leu Ala Gly 65 70 75 80 Lys Glu Lys Ala Lys Val Thr Gln Met Gly Pro Tyr Thr Tyr Arg Glu 85 90 95 Tyr Arg Pro Arg Glu Asn Val Thr Tyr Leu Glu Asn Gly Thr Lys Ile 100 105 110 Phe Ala Thr Asn Pro Lys Ser Phe Val Phe Leu Arg Asn Met Ser Ala 115 120 125 Gly Asp Pro Glu Val Asp Arg Val Thr Thr Val Asn Ile Pro Met Ile 130 135 140 Ala Val Met Asn Glu Leu Asn Ser Tyr Ser Phe Phe Val Arg Thr Ala 145 150 155 160 Val Ser Met Tyr Met Gly Ser Met Gly Met Gly Leu Phe Met Asn Arg 165 170 175 Thr Val His Glu Ile Leu Trp Gly Phe Lys Asp Pro Leu Leu Thr Lys 180 185 190 Leu His Ala Met Arg Pro Glu Val Asp Glu His Phe Gly Leu Met Tyr 195 200 205 Asn Lys Asn Gly Thr His Glu Gly Glu Phe Val Phe His Thr Gly Glu 210 215 220 Lys Asn Tyr Met Asn Tyr Gly Lys Ile Asp Thr Trp Asn Gly Ile Ser 225 230 235 240 Gln Met Asn Trp Trp Ser Ser Asn Gln Ser Asn Met Ile Asn Gly Thr 245 250 255 Asp Gly Ser Val Phe His Thr Phe Leu Ser Arg Lys Glu Leu Leu Tyr 260 265 270 Ile Phe Ala Ala Asp Leu Cys Arg Ser Ile His Leu Gly Tyr Val Arg 275 280 285 Asp Met Glu Val Lys Gly Ile Pro Ala Phe Arg Phe Ala Pro Pro Ser 290 295 300 Asp Val Leu Ala Pro Pro Asp Glu Asn Pro Ala Asn Ala Gly Phe Cys 305 310 315 320 Val Pro Ala Gly Asp Cys Leu Gly Lys Gly Val Leu Lys Val Ser Val 325 330 335 Cys Arg Gln Gly Ala Pro Ile Val Val Ser Phe Pro His Phe Tyr Gln 340 345 350 Ala Asp Glu Arg Tyr Ile Asn Ala Ile Glu Gly Met Asn Pro Asn Glu 355 360 365 Glu Glu His Glu Thr Tyr Leu Asp Ile Asn Pro Thr Thr Gly Val Pro 370 375 380 Ile Arg Ala Cys Lys Arg Ala Gln Leu Asn Ile Ile Leu Lys Arg Val 385 390 395 400 Arg Gly Phe Pro Asn Thr Lys Phe Leu Asn Glu Thr Ile Phe Pro Ile 405 410 415 Met Tyr Val Asn Glu Thr Ala Thr Ile Asp Asp Glu Ser Ala Ala Gln 420 425 430 Met Arg Met Leu Leu Leu Ile Val Thr Val Val Ser Asn Phe Pro Val 435 440 445 Ile Ile Leu Ala Leu Gly Val Ile Leu Leu Val Val Leu Ile Phe Leu 450 455 460 Val Cys Arg Asn Arg Gln Arg Lys Asn Glu Val Lys Arg Ile Asp Phe 465 470 475 480 Thr Glu Ala Phe His Ser Phe Ala Thr Thr Lys Asp Glu Thr Ala Tyr 485 490 495 Thr Gln Val Ser Asn Gln Ala Glu Asp Ser Pro Glu Asn Arg Asn Asn 500 505 510 Gln Pro Leu Arg Asn Gly Ser Tyr Ile Ala Met Ser Pro Val Glu Ala 515 520 525 Gln Lys Cys 530 159 418 PRT Danio rerio 159 Met Met Arg Met Ser Trp Met Val Thr Val Ile Asn Arg Arg Met Met 1 5 10 15 Lys Ile Leu Ile Ala Leu Ala Leu Ile Ala Tyr Ile Ala Ser Val Trp 20 25 30 Gly Thr Tyr Ala Asn Met Arg Ser Ile Gln Glu His Gly Glu Met Lys 35 40 45 Ile Glu Gln Arg Ile Asp Glu Ala Val Gly Pro Leu Arg Glu Lys Ile 50 55 60 Arg Glu Leu Glu Leu Ser Phe Thr Gln Lys Tyr Pro Pro Val Lys Phe 65 70 75 80 Leu Ser Glu Lys Asp Arg Lys Arg Ile Leu Ile Thr Gly Gly Ala Gly 85 90 95 Phe Val Gly Ser His Leu Thr Asp Lys Leu Met Met Asp Gly His Glu 100 105 110 Val Thr Val Val Asp Asn Phe Phe Thr Gly Arg Lys Arg Asn Val Glu 115 120 125 His Trp Ile Gly His Glu Asn Phe Glu Leu Ile Asn His Asp Val Val 130 135 140 Glu Pro Leu Tyr Ile Glu Val Asp Gln Ile Tyr His Leu Ala Ser Pro 145 150 155 160 Ala Ser Pro Pro Asn Tyr Met Tyr Asn Pro Ile Lys Thr Leu Lys Thr 165 170 175 Asn Thr Ile Gly Thr Leu Asn Met Leu Gly Leu Ala Lys Arg Val Gly 180 185 190 Ala Arg Leu Leu Leu Ala Ser Thr Ser Glu Val Tyr Gly Asp Pro Glu 195 200 205 Val His Pro Gln Asn Glu Asp Tyr Trp Gly His Val Asn Pro Ile Gly 210 215 220 Pro Arg Ala Cys Tyr Asp Glu Gly Lys Arg Val Ala Glu Thr Met Cys 225 230 235 240 Tyr Ala Tyr Met Lys Gln Glu Gly Val Glu Val Arg Val Ala Arg Ile 245 250 255 Phe Asn Thr Phe Gly Ser Arg Met His Met Asn Asp Gly Arg Val Val 260 265 270 Ser Asn Phe Ile Leu Gln Ala Leu Gln Gly Glu Ala Leu Thr Val Tyr 275 280 285 Gly Ser Gly Ser Gln Thr Arg Ala Phe Gln Tyr Val Ser Asp Leu Val 290 295 300 Asn Gly Leu Val Ser Leu Met Asn Ser Asn Ile Ser Ser Pro Val Asn 305 310 315 320 Leu Gly Asn Pro Glu Glu His Thr Ile Leu Glu Phe Gly Ser Leu Ile 325 330 335 Lys Ser Leu Val Ala Ser Arg Ser His Ile Gln Phe Leu Ser Glu Ala 340 345 350 Gln Asp Asp Pro Gln Arg Arg Arg Thr Asp Ile Arg Arg Ala Lys Leu 355 360 365 Leu Leu Gly Trp Glu Pro Val Val Pro Leu Glu Glu Gly Leu Asn Lys 370 375 380 Thr Ile Gln Tyr Phe Ser Arg Glu Leu Glu His Gln Ala Asn Asn Gln 385 390 395 400 Tyr Ile Pro Lys Pro Lys Ala Ala Arg Met Lys Lys Gly Arg Pro Arg 405 410 415 His Asn 

1. An isolated nucleic acid molecule comprising a nucleic acid sequence having at least 75% nucleic acid sequence identity to the 459 nucleic acid sequence of SEQ ID NO:59 over at least 600 contiguous nucleic acids, wherein said nucleic acid molecule functions in kidney development.
 2. The isolated nucleic acid molecule of claim 1, wherein said nucleic acid molecule comprises the sequence of SEQ ID NO:59.
 3. The isolated nucleic acid molecule of claim 1, wherein said nucleic acid molecule is a human or mouse nucleic acid molecule.
 4. The nucleic acid molecule of claim 1, wherein said nucleic acid molecule is a zebrafish nucleic acid molecule.
 5. A vector comprising the isolated nucleic acid molecule of claim
 1. 6. A cell comprising the vector of claim
 5. 7. The isolated nucleic acid molecule of claim 1, wherein said nucleic acid molecule further comprises a viral insertion at a nucleotide corresponding to nucleotide 210 of SEQ ID NO:59.
 8. A zebrafish comprising the nucleic acid molecule of claim
 7. 9. An isolated polypeptide comprising an amino acid sequence having at least 75% sequence identity to the 459 amino acid sequence of SEQ ID NO:60 over at least 250 contiguous amino acids, wherein said polypeptide functions in kidney development.
 10. The isolated polypeptide of claim 9, wherein said polypeptide comprises the sequence of SEQ ID NO:60.
 11. A method of treating or preventing a kidney disorder in an organism, said method comprising the step of contacting said organism with a therapeutically effective amount of a nucleic acid comprising the nucleic acid of claim 1, or its complement, wherein the nucleic acid is sufficient to elicit an alteration in expression of a 459 nucleic acid sequence in said organism, and wherein said alteration in the level of expression treats or prevents a kidney disorder.
 12. The method of claim 11, wherein said nucleic acid is a cDNA or an mRNA molecule and said contacting results in an increase in expression of the polypeptide encoded by a nucleic acid sequence comprising the nucleic acid sequence of SEQ ID NO:59.
 13. The method of claim 11, wherein said nucleic acid is a double-stranded RNA molecule and said contacting leads to a decrease in expression, or inhibits biological activity, of a nucleic acid sequence comprising SEQ ID NO:59.
 14. The method of claim 11, wherein said nucleic acid is an anti-sense RNA molecule, and said contacting leads to a decrease in expression, or inhibits biological activity, of a nucleic acid sequence comprising SEQ ID NO:59.
 15. A method for diagnosing a kidney disorder or the propensity to develop a kidney disorder in an organism, said method comprising detecting an alteration in the level of 459 polypeptide expression in a sample derived from a first organism and comparing the level of expression to that of a 459 polypeptide in a sample derived from a second, control organism, wherein an alteration in the level of expression or activity of said 459 polypeptide in said first organism relative to said second organism is indicative of said first organism having or having a propensity to develop a kidney disorder.
 16. The method of claim 15, wherein said 459 polypeptide comprises the amino acid sequence of SEQ ID NO:60. 